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(57) Abstract 

A non-naturally occurring variant of a C-terminal fragment of a Plasmodium merozoite surface protein- 1 (MSP-1) wherein said 
variant has (i) a reduced affinity, compared with a naturally occurring Plasmodium MSP-1 19, for at least one first antibody capable f 
blocking the binding of a second antibody, which second antibody inhibits the proteolytic cleavage of Plasmodium MSP-1 42 and (ii) 
substantially the same affinity for at least one third antibody compared with said naturally occurring Plasmodium MSP-1 19, which third 
antibody inhibits the proteolytic cleavage of Plasmodium MSP-1 42 is provided for use in an anti-malarial vaccine. 
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VACCINE 

Field of the Invention 

5 The present invention relates to modified Plasmodium MSP-1 protein variants and their 
use in producing a vaccine against malaria. It also relates to a method for the rational 
design of suitable variants. 

Background to the Invention 

10 

Malaria is a devastating disease that causes widespread morbidity and mortality in areas 
where it is transmitted by anopheline mosquitoes. In areas of high transmission young 
children and non-immune visitors are most at risk from this disease, which is caused by 
protozoa of the genus Plasmodium. In areas of lower or unstable transmission, epidemics 
15 of the disease can result and afflict individuals of all ages. The most dangerous form of 
malaria, responsible for much of the morbidity and most of the mortality, is caused by the 
species Plasmodium falciparum. It has been estimated that 2 billion people are at risk 
from malaria, with 200-300 million clinical cases and 1-2 million deaths each year. 

20 The parasite has a complex life cycle in its human and mosquito hosts. In humans the 
stage of the life cycle which is responsible for the clinical symptoms of the disease occurs 
in the bloodstream. During this phase the parasite is largely hidden within host red blood 
cells. Here the parasite grows and multiplies. For example, within a red blood cell each 
P. falciparum parasite divides several times to produce approximately 20 new ones during 

25 a 48 hour cycle. At this point the red blood cell is burst open and the parasites (called 
merozoites at this stage) are released into the bloodstream. The merozoites must enter new 
red blood cells in order to survive and for the cycle of replication in the blood to continue. 
If the parasites do not manage to enter red blood cells they cannot survive for very long 
and are rapidly destroyed. Symptoms of malaria such as fever are associated with this 

30 cyclic merozoite release and re-invasion of red blood cells. 
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There is an urgent need for a vaccine against malaria. There is no effective vaccine 
currently available. In addition, mosquito control- by the spraying of residual insecticides 
is either becoming ineffective or considered to be unacceptable, and there is a very 
worrying spread of drug resistance within parasites. The rapid spread of drug resistance is 
worrying because compounds such as the cheap and once-effective chloroquine are no 
longer useful in many parts of the world, and there are few if any new drugs available that 
are both cheap and effective. Vaccines against microorganisms can be very cost effective 
and efficient ways to protect populations against infectious diseases. 

Because of the complexity of the parasite's life cycle there are a number of points in its 
development within humans that could be the target of a protective immune response. It 
is known that with increasing age and exposure individuals do become immune to 
malaria, suggesting that protective responses do develop with time. Broadly speaking 
there are three types of vaccine strategy: to target the pre-erythrocytic stages, the asexual 
blood stage and the sexual stage. The pre-erythrocytic stages are the sporozoites that are 
injected by an infected mosquito when it takes a blood meal and the initial development of 
the parasite in the liver. The asexual blood stage is the infection and release of merozoites 
from red blood cells that occurs in a cyclic manner, and the stage responsible for the 
manifestation of the clinical symptoms. The sexual stage takes place in the mosquito's 
gut after it has ingested gametocytes in a blood meal and this initiates the infection of the 
insect to complete the cycle; a vaccine against the sexual stages would not protect the 
individual but could reduce transmission and therefore the incidence of malaria in a given 
human population. 

During the asexual cycle in the blood the parasite is directly exposed to the host's immune 
system, and in particular to antibodies circulating within the bloodstream, only transiently: 
when merozoites are released by rupture of one cell and before they penetrate another. If 
there are specific antibodies that can bind to the surface of the parasite then it is possible 
that these antibodies will interfere with the ability of the parasite to invade a new red 
blood cell. In fact it has been shown that several monoclonal antibodies that recognise 
single epitopes on parasite surface proteins, are capable of neutralising the parasite and 
preventing the cycle of reproduction within red blood cells. 
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One of the best characterised proteins on the surface of the merozoite is called merozoite 
surface protein 1 (MSP- 1). MSP- 1 is a large protein that varies in size and amino acid 
sequence in different parasite lines. It is synthesised as a precursor molecule of -200 kDa 
5 by the intracellular parasite and located on the parasite's surface. During release of 
merozoites from red blood cells and the re-invasion of new erythrocytes the protein 
undergoes at least two proteolytic modifications. In the first modification as a result of a 
process called primary processing, the precursor is cleaved to four fragments of -83, 30, 
38 and 42 kDa that remain together as a complex on the merozoite surface. This complex 

10 also contains two other proteins of 22 kDa and 36 kDa derived from different genes. The 
complex is maintained by non-covalent interactions between the different subunits and is 
held on the merozoite surface by a glycosyl phosphatidyl inositol anchor, attached to the 
C-terminus of the 42 kDa fragment and inserted into the plasma membrane of the 
merozoite. At the time of merozoite invasion of an erythrocyte the C-terminal 42 kDa 

15 fragment is cleaved by a second proteolytic cleavage in a process called secondary 
processing. The result of secondary processing is that the entire complex is shed from the 
surface of the merozoite except for a C-terminal sub-fragment that consists of just under 
one hundred amino acids and which is carried into the newly invaded erythrocyte on the 
surface of the merozoite. 

20 

Based on sequence similarities, the structure of this small C-terminal fragment (called 
MSP- 1 19) was suggested to consist of two epidermal growth factor (EGF)-like domains 
(see sequence in Figure 1) (Blackman et at., 1991). An EGF-like motif consists of a 45-50 
amino acid sequence with a characteristic disulphide bonding pattern and such domains 

25 occur frequently in extracellular modular proteins of animals. In the MSP-1 C-terminal 
fragment each of the motifs contains six Cys residues proposed to form three disulphide 
bonds and each motif has a partial match to the EGF consensus (see Figure 1 ). However, 
because the degree of similarity is limited and since the pattern of its disulphide bonding 
is not known, the designation of the MSP-1 C-terminal fragment as comprised of EGF- 

30 like structures has been regarded as tentative. Other relatively divergent potential EGF- 
like sequences occur in Plasmodium proteins, but previous structure determinations have 
been confined to those from metazoan organisms (Campbell et aL, 1998). 
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A number of studies have implicated MSP-1 as the target of a protective immune 
response. Although the goal of this work is to develop a malaria vaccine for use in 
humans, out of necessity most of this experimental work has been done either in model 
5 animal systems or in vitro. These include studies of the effect of specific antibodies on 
parasite invasion of erythrocytes in vitro, passive immunisation studies in rodent malaria 
models in laboratory mice and direct immunisation in both rodent and primate malaria 
models using either native protein (derived from the parasite) or recombinant protein 
expressed from parts of the MSP-1 gene in heterologous organisms. Sero-epidemiological 

10 studies have also showed a correlation between human antibody responses to parts of the 
MSP-1 molecule and protection against clinical disease. Much, but not all, of the work 
has focused on the immune response to the C-terminal MSP-1 19. For example some 
monoclonal antibodies that recognise MSP-1 19 prevent red blood cell invasion in in vitro 
cultures (Blackman et al.* 1990). Interestingly, these antibodies that inhibit invasion also 

15 inhibit the secondary processing of the 42 kDa fragment, suggesting the mechanism by 
which they work is by steric hinderance of the protease responsible for secondary 
processing (Blackman et ai, 1994). Since secondary processing goes to completion 
during successful invasion, if it cannot occur then invasion is interrupted. 

20 All of the work described above would suggest that MSP-1 and in particular polypeptides 
based on the C-terminal sequence that forms the 42 kDa or the MSP-1 19 region, should be 
very good candidates for malaria vaccine development. However, several studies have 
shown that the epitopes or binding sites for antibodies on MSP-1 19 require a correct 
polypeptide tertiary structure, and that this is destroyed by treatments that reduce the 

25 disulphide bonds that are postulated to be present between the cysteine residues present in 
MSP-1 19. This limitation appears to have been overcome by the expression of 
recombinant protein in ways that allow antibodies that recognise the native parasite MSP- 
1 to bind. Other investigators have suggested that other parts of MSP-1 also have 
potential for inclusion in a vaccine, however the MSP-1 C-terminal fragment is currently 

30 the lead candidate for development of a vaccine against the blood stages of the malaria 
parasite (Diggs era/., 1993; Stoute et a/., 1998). 
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As stated above, every -48 hours P. falciparum merozoites are released from the infected 
erythrocyte to re-invade new red blood cells and during this time they are exposed to the 
host's immune system. Therefore, the question arises as to how the parasite has evolved to 
avoid the potentially lethal effects of, for example, neutralising antibodies. In other 
5 infectious micro-organisms it is clear that there is a constant battle between the immune 
system and the micro-organism, and that sophisticated mechanisms have been evolved by 
micro-organisms to evade the immune response. For example antigenic variation and 
antigenic diversity are two mechanisms that involve presenting the immune system with 
;i a moving target" such that even though an immune response to one variant of the micro- 

10 organism may kill that variant, new variants are produced that are at least partially or fully 
resistant to the immune response. In the case of malaria merozoites and in particular 
MSP-1 an alternative mechanism has been proposed whereby the binding of some 
antibodies ("blocking antibodies") can prevent the binding of neutralising antibodies and 
thereby allow the parasite to successfully invade a red blood cell even in the presence of 

15 neutralising antibodies (Guevara Patino et aL, 1997). These blocking antibodies may be 
of two types, those against epitopes that are formed from amino acids that are distant in 
the linear primary sequence from the epitopes that are the target of neutralising antibodies, 
and those that are against epitopes that overlap with the epitopes of the neutralising 
antibodies. This represents a novel mechanism by which a parasite can evade an effective 

20 .. immune response, and unlike mechanisms based on antigenic polymorphism or diversity, 
it is not dependent upon amino acid sequence diversity. 

Some monoclonal antibodies (mAbs) that bind to MSP-1 19 inhibit the proteolytic cleavage 
and erythrocyte invasion, suggesting that cleavage is a prerequisite for invasion 

25 (Blackman et al. y 1994). Other mAbs that bind to the MSP-1 C-terminal fragment do not 
inhibit processing or invasion but block the binding of the inhibitory neutralizing 
antibodies. Other antibodies that bind to MSP-1 19 neither inhibit nor block the binding of 
inhibitory antibodies. In the presence of blocking antibodies, inhibitory antibodies are 
ineffective and invasion proceeds. The balance between inhibitory and blocking 

30 antibodies induced by immunisation may be a critical factor in determining whether or not 
the immune response is effective in preventing invasion (Guevara Patino et a/., 1 997). 
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Summarv of the Invention 

An object of the present invention is therefore to provide an effective vaccine against the 
malaria parasite based on variants of the Plasmodium MSP-1 protein. In designing such a 
5 vaccine, the following criteria should be met: 

1. The amino acid sequence of the polypeptide to be used in the vaccine should 
contain epitopes that are the targets of, and can induce, neutralising antibodies. 

10 2. The polypeptide should ideally not include amino acid sequences that only form 
epitopes for blocking antibodies. 

3. If the polypeptide contains epitopes for both neutralising and blocking antibodies 
then it should be modified to remove the blocking antibody epitopes without affecting the 
1 5 neutralising epitopes. 

To assist in the design of candidate vaccine polypeptides fulfilling these three criteria, it is 
important to determine the three-dimensional structure of the MSP-1 C -terminal fragment 
since this will help in mapping sites of antibody interactions with this fragment. We have 
20 therefore determined the solution structure of the MSP-1 C -terminal, including the pattern 
of disulphide bonding, using NMR techniques. 

We have made amino acid substitutions in the sequence of MSP-1 19 that prevent the 
binding of individual blocking monoclonal antibodies, without affecting the binding of 

25 neutralising antibodies. By determining the 3-dimensional structure of MSP-1 19 we have 
identified where these antibody binding sites are located in the tertiary structure and this 
has allowed other amino acid substitutions to be made that have similar properties. We 
have shown that several substitutions, each affecting the binding of one or more blocking 
antibodies can be combined into a single molecule, and that these modified molecules 

30 continue to bind the neutralising antibodies but fail to bind any of the blocking antibodies. 
Such modified molecules are expected to. be much more effective than the natural or wild- 
type protein structure at inducing a protective neutralising antibody response when used to 
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immunise individuals as a malaria vaccine. In addition we have made other modifications 
in the primary structure of the molecule which do -not affect the binding of the neutralising 
antibodies but which may contribute to increased immunogenicity of the molecule. The 
modified MSP- 1 19 structures, either alone or coupled to other carriers, which may or may 
5 not contain other parts of MSP-1 to enhance the immunogenicity (for example a 
combination of the remainder of the MSP-1 42 kDa fragment with the modified MSP-1 19) 
and provide additional T cell epitopes, would be more effective vaccines than equivalent 
structures that have not been modified in this way. 

10 Accordingly, the present invention provides a non-naturaily occurring variant of a 
C-terminal fragment of a Plasmodium merozoite surface protein- 1 (MSP-1) wherein said 
variant has (i) a reduced affinity, compared with a naturally occurring Plasmodium 
MSP-1 19, for at least one first antibody capable of blocking the binding of a second 
antibody, which second antibody inhibits the proteolytic cleavage of Plasmodium 

15 MSP-I42 and (ii) substantially the same affinity for said second antibody compared with 
said naturally occurring Plasmodium MSP-1 19. 

Preferably, the Plasmodium MSP-1 19 and MSP-I42 are Plasmodium falciparum MSP-1 19 
and MSP-1 42. 

20 

The first antibody is preferably selected from mAbs IE1, 2.2, 7.5, 9C8 and 111.4. The 
second antibody is preferably selected from mAbs 12.8, 12.10 and 5B1. 

The present invention further provides a non-naturally occurring variant of a C-terminal 
25 fragment of a Plasmodium merozoite surface protein- 1 (MSP-1) comprising an amino acid 
modification at any one of amino acid residues 14, 15, 27, 31, 34 , 43 48 and 53 of the 
Plasmodium falciparum MSP-1 19 amino acid sequence shown as SEQ I.D. No. 1 or their 
equivalent positions in other Plasmodium MSP-1 19 polypeptides. 

30 Preferably said modifications are substitutions selected from Gin 14— »Arg, Glnl4-»Gly ? 
Asnl 5-*Arg, Glu27->Tyr, Leu3 1 ->Arg, Tyr34-»Ser, Tyr34->Ile, Glu43-»Leu ? 
Thr48->Lys and Asn53— >Arg and their equivalents in other Plasmodium MSP-1 !9 



MSOOCID: <WO 0063245A2_I_> 



WO 00/63245 



PCT/GB00/01558 



-8- 

polypeptides. More preferably said substitutions are combinations of substitutions 
selected from [Glu27-»Tyr. Leu31— >Arg and Glu43^Leu] ? [GIu27— >Tyr. Leu31-*Arg, 
Tyr34— >Ser and Glu43— »Leuj. [Asnl5— ►Arg, Glu27-^Tyr ; . Leu3 1 Arg and 
Glu43^Leu] and their equivalents in other Plasmodium MSP-1 19 polypeptides. 

5 

In a preferred embodiment, a variant MSP-1 polypeptide of the invention further 
comprises a mutation at Cysl2 and/or Cys28 of the Plasmodium falciparum MSP-1 1 9 
amino acid sequence shown as SEQ LD. No. 1. Preferably such modifications are 
substitutions selected from Cysl2— *Ile and Cys28— "Tip, and Cysl2— *Ala and 
10 Cys28->Phe. 

Most preferably the substitutions are combinations selected from [Cysl2— >IIe, Asn 
15— Arg, Glu27— Tyr, Cys28-*Trp, Leu31—Arg, Glu43— >Leu], [Cysl2— He, Asn 
15— Arg, Glu27— >Tyr, Cys28— Trp, Leu31— Arg, Glu43— Leu, Asn53— Arg], 
15 [Cysl2— He, Asn 15— Arg, Glu27-— Tyr, Cys28— Trp, Leu31— Arg, Tyr34— Ser, 
Glu43— >Leu, Asn53— Arg] and their equivalents in other PlasmodiumMS?-\\? 
polypeptides. 

The present invention also provides a method for producing a Plasmodium MSP-1 variant 
20 for use in preparing a vaccine composition which method comprises modifying one or 
more amino acid residues of a Plasmodium MSP-1 C-terminal fragment such that the 
resulting derivative has (i) a reduced affinity, compared with a naturally occurring 
Plasmodium MSP-1 19, for at least one first antibody capable of blocking the binding of a 
second antibody, which second antibody inhibits the proteolytic cleavage of Plasmodium 
25 MSP-1 42 and (ii) substantially the same affinity for said second antibody compared with 
said naturally occurring Plasmodium MSP-1 19. In particular the method of the invention 
preferably comprises as a preliminary step, selecting a candidate amino acid residue by 
reference to a three dimensional NMR model structure, preferably as set out in Table 2. 
More specifically, the 3D model structure is used to select a surface exposed amino acid 
30 residue; Advantageously, a further step is included of computer modelling the three 
dimensional structure of the variant to exclude polypeptides that do not fold correctly. 
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The present invention also provides a non-naturally occurring Plasmodium MSP-1 variant 
obtained by the method of the invention. 

In a further aspect, the present invention provides a polynucleotide encoding a variant of 
the invention operably linked to a regulatory sequence capable of directing the expression 
of said nucleotide in a host cell. The polynucleotide may comprise a sequence which has 
been optimised for expression in the host cell. The host cell may be a Pichia pastoris cell. 
Also provided is a nucleic acid vector comprising a polynucleotide of the invention, 
including viral vectors, and a host cell comprising a nucleotide or vector of the invention. 

In another aspect, the present invention provides a pharmaceutical composition 
comprising a variant of the invention, a polynucleotide of the invention or a vector of the 
invention together with a pharmaceutically acceptable carrier or diluent. 

Preferably, the composition further comprises an immunogenic Plasmodium polypeptide 
or fragment or derivative thereof such as MSP-1 33 or a fragment or derivative thereof 
which may be covalently attached to the non-naturally occuring MSP-1 19. It is preferred 
not to use wild-type MSP-1 19 sequences. The further immunogenic peptide may itself be 
derivatised in an analogous manner as described above for MSP-1 19. Thus, epitopes 
present in the peptide may be identified and modified to prevent binding of blocking 
antibodies, without affecting the binding of neutralising antibodies. These epitopes may be 
capable of binding to antibodies which have similar properties to the first antibody 
described above, for example, in binding affinity. The further immunogenic peptide may 
comprise several such modifications in its amino acid sequence. 

The present invention also provides a method for producing anti-MSP- 1 antibodies which 
method comprises administering a polypeptide variant of the invention, or a 
polynucleotide of the invention or a vector of the invention to a mammal, typically a non- 
human mammal. 

In a preferred embodiment, the present invention provides a method for producing 
polyclonal anti-MSP- 1 antibodies which method comprises administering a polypeptide 



WO 00/63245 PCT/GBOO/01558 

-10- 

variant of the invention, or a polynucleotide of the invention or a vector of the invention to 
a mammal, typically a non-human mammal, and extracting the serum from said mammal. 
Also provided is an antibody produced by the said methods. 

5 The polypeptides, nucleotides and vectors of the present invention may be used in 
methods of treating and/or preventing malaria caused by Plasmodium species, in particular 
Plasmodium falciparum. Accordingly, the present invention provides a method of 
inducing immunity against malaria induced by Plasmodium falciparum which comprises 
administering to a person in need of such immunity an effective amount of a variant, a 
10 polynucleotide or a vector of the invention. 

Also provided is a method of immunizing a mammal, said method comprising 
administering an effective amount of a variant, a polynucleotide or a vector of the 
invention. In particular, said mammal is immunized against malaria. Preferably the 
15 mammal is a human. 

The present invention also provides a method of treating a malaria infection in a human 
patient which comprises administering to the patient an effective amount of the 
pharmaceutical composition of the invention. 

20 

We further provide according to the present invention a nucleic acid encoding a 
Plasmodium MSP-1 polypeptide, in which the nucleic acid is optimised for expression in 
a heterologous host cell.. Preferably, the heterologous host is a Pischia pastoris cell. The 
MSP-1 polypeptide may be selected from the group comprising an MSP-I42 polypeptide 

25 comprising a sequence shown in Figures 2C and 2E. an MSP-1 19 polypeptide comprising 
a sequence shown in Figure 2C, and an MSP-1 33 polypeptide comprising a sequence 
shown in Figure 2E. The optimised nucleic acid may comprise a sequence selected from 
the sequences of Figure 2A, Figure 2B and Figure 2D. We further provide a vector 
comprising such a nucleic acid, a host cell comprising such a vector, and a pharmaceutical 

30 composition comprising such a nucleic acid or a vector, together with a pharmaceutically 
acceptable carrier or diluent. The pharmaceutical composition may further comprise an 
immunogenic Plasmodium polypeptide or fragment or derivative thereof. 
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Detailed description of the invention 

Although in general the techniques mentioned herein are well known in the art, reference 
5 may be made in particular to Sambrook et aL, Molecular Cloning, A Laboratory Manual 
(1989) and Ausubel et aL, Current Protocols in Molecular Biology (1995), John Wiley & 
Sons, Inc. 

A. MSP- 1 variant polypeptides 

10 

The variant MSP-1 polypeptides of the present invention will be described with reference 
to Plasmodium falciparum MSP-1 amino acid sequences. However, it should be 
appreciated that except where otherwise stated, all references to MSP-1 polypeptides 
include homoiogues of MSP-1 found in other Plasmodium species, such as P. vzvox, 
15 P. malariae and P, ovale which all infect humans and P. yoelii which infects mice. 

The variant MSP-1 polypeptides of the present invention are based on C -terminal 
fragments of the Plasmodium falciparum MSP-1 42 polypeptide shown as SEQ I.D. Nos. 2 
or 3. Such polypeptides will comprise some or all of the MSP-1 19 region (SEQ I.D. No. 
20 1), preferably at least substantially all of the domain 1 and/or domain 2 EGF-like 
sequences found in MSP-1 19 (approximately amino acids 1-47 and amino acids 48-96, 
respectively, of SEQ I.D. No. 1). It is particularly preferred to use regions that are 
conserved in most, more preferably all parasites of a single species to increase the 
effectiveness of the variant as a vaccine against a wide range of strains. 

25 

Variant MSP-1 polypeptides of the present invention comprise modifications to their 
primary amino acid sequence that reduce the ability of blocking antibodies to bind to the 
MSP-1 polypeptides. In addition, any modifications made should maintain epitopes 
recognised by neutralising antibodies such that the affinity of the neutralising antibodies 
30 for the MSP-1 variant is substantially the same as for naturally-occurring MSP-1 
polypeptides (such as an MSP-1 42 polypeptide having the sequence shown in SEQ I.D. 
Nos. 2 or 3). Some reduction in the binding of some neutralising antibodies may be 
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tolerated since the primary objective is to inhibit the binding of blocking antibodies and it 
is likely that an effective reduction in the bindingof blocking antibodies will compensate 
in terms of overall vaccine efficacy for a small reduction in neutralising antibody binding. 

5 Neutralising antibodies in the context of the present invention are antibodies that inhibit 
malaria parasite replication. A variety of neutralising antibodies, polyclonal and 
monoclonal, are known in the art, including mAbs 12.8. 12.10 and 5B1 referred to in the 
Examples. The activity of neutralising antibodies can be determined in a variety of ways 
that have been described in the an. For example, a convenient assay method described in 
10 Blackman et a/., 1994 involves using preparations of merozoites (Blackman et al., 1993; 
Mrema et aL. 1982) to measure cleavage of MSP-I42 into MSP-I33 and MSP-1 19. Briefly, 
freshly isolated merozoites are washed in ice-cold buffer and divided up into aliquots of 
about 2xl0 9 merozoites. A test antibody is added to each aliquot and the sample 
incubated at 37°C for 1 hour. The samples are then subjected to SDS-PAGE under non- 
15 reducing conditions on a 12.5% polyacrylamide gel, Western blotted and the blot probed 
with antiserum to MSP-1 33. In the control sample, two main bands are seen - one 
corresponding to MSP-1 42 and one lower molecular weight band corresponding to 
MSP-1 33. Neutralising antibodies will reduce the amount of the lower molecular weight 
band as a result of inhibiting secondary proteolytic processing of MSP-1 42. 

20 

This method is a particularly preferred method for assessing the efficacy of neutralising 
antibodies in the presence of antibodies believed to act as blocking antibodies. Where 
candidate competing blocking antibodies are to be tested, the merozoite sample is 
preincubated with a blocking antibody for 15 mins on ice prior to incubation with a 
25 neutralising antibody at 37°C for 1 hour as described above. Thus blocking antibodies can 
readily be identified and/or characterised using such an assay method. 

Other assay methods include merozoite invasion inhibition tests as described in Blackman 
etal., 1990. 

30 

As discussed above, blocking antibodies are defined in the context of the present 
invention as antibodies that inhibit the binding of neutralising antibodies to MSP-1 but 
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which do not themselves inhibit invasion of red blood ceils by malaria parasites. Thus 
they "block" the neutralising function of the neutralising antibodies. A variety of blocking 
antibodies have been characterised in the art. including mAbs IEL 2.2, 7.5 and 111.4 
referred to in the Examples. As discussed above, blocking antibodies can conveniently be 
5 identified and/or characterised using assays that test their effect on neutralising antibody 
function. 

Modifications that may be made to produce MSP-1 variants of the invention include 
substitutions, deletions and insertions. It is particularly preferred to use substitutions to 
minimise disruption of the secondary/tertiary structure of the polypeptide. Furthermore, 
particularly preferred substitutions are those that replace one class of amino acid with 
another class, such as an aliphatic non-polar residue with a charged polar residue. For 
example, the twenty naturally occurring amino acids may be divided into four main 
groups (aliphatic non-polar [G, A, P, I, L and V], polar un-charged [C, S, T, M, N and Q], 
polar charged [D, E, K and R] and aromatic [H, F, W and Y]) and it is preferred to replace 
an amino acid from one group with an amino acid from another group. 

Other possibilities include replacing a positively charged side chain with a negatively 
charged side chain, replacing an amino acid with a large side chain with an amino acid 
20 with a smaller or no side chain (glycine), replacing a polar amino acid with a charged 
polar amino acid, replacing a large aromatic amino acid with an amino acid with a small 
side chain, and replacing cysteine residues that are involved in disulphide bonds. 

Particularly preferred modifications are an amino acid modification at any one of amino 
acid residues 14, 15, 27, 31, 34 , 43, 48 and 53 of the Plasmodium falciparum MSP-1 !9 
amino acid sequence shown as SEQ I.D. No. 1 or their equivalent positions in other 
Plasmodium MSP-1 19 polypeptides. These residues are all almost within the EGF-like 
domain 1 . It is known that the epitopes of some antibodies contain amino acid sequences 
that are within EGF-like domain 2, therefore equivalent modifications may also be made 
in EGF-like domain 2. Preferred examples of modifications include the following 
substitutions Glnl4-*Arg, Glnl4-»Gly, Asnl5-»Arg, Glu27-»Tyr ? Leu3 1 — > Arg, 
Tyr34->Ser, Tyr34->Ile, Glu43— >Leu. Thr48-*Lys and/or Asn53->Arg and their 
equivalents in other Plasmodium MSP-1 19 polypeptides. 
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It is especially preferred to carry out more than one modification, i.e. to use combinations 
of modifications, such as two or more or three or more. In a preferred embodiment, an 
MSP-1 variant of the invention comprises a combination of amino acid substitutions 
5 selected from [Glu27-»Tyr. Leu31-*Arg and Glu43— >Leu] ? [Glu27— >Tyr, Leu31-*Arg, 
Tyr34->Ser and Glu43-»Leu], [Asnl5— >Arg, Glu27^Tyr, Leu31->Arg and 
Glu43— >Leu] and their equivalents in other Plasmodium MSP-1 1 9 polypeptides. 

A particularly preferred combination further comprises a modification to Cysl2 and/or 
10 Cys28 (and/or their equivalent residues in EGF-like domain 2) to disrupt the disulphide 
bond. Preferably such modifications are substitutions selected from Cysl2— ►He and 
Cys28— Trp, and Cysl2—Ala and Cys28— * Phe. 

Most preferably the substitutions are combinations selected from [Cysl2— *Ile. Asn 
15 15-+Arg, Glu27— Tyr, Cys28— Trp, Leu31-*Arg, Glu43-+Leu], [Cysl2-> He, Asn 
15— Arg, Glu27— Tyr, Cys28— Trp, Leu31—Arg, Glu43—Leu, Asn53— Arg], 
[Cysl2— lie, Asn 15— Arg, Glu27— Tyr, Cys28— Trp, Leu31— Arg, Tyr34— Ser, 
Glu43— *Leu, Asn53— *Arg] and their equivalents in other Plasmodium MSP-1 19 
polypeptides. 

20 

Substitutions are not confined to using naturally occurring amino acids - non-naturally 
occurring amino acid analogues may also be used, in particular where solid phase 
synthesis is to be used to chemically synthesise the variant, as opposed to recombinant 
technology. 

25 

Modifications to MSP-1 amino acid sequences may be carried out using standard 
techniques such as site-directed mutagenesis using the polymerase chain reaction. 
Alternatively, variants may be obtained by solid phase synthetic techniques. 

30 To determine whether a variant MSP-1 polypeptide produced by modification of its 
primary amino acid sequence complies with the criteria specified above, the affinity of at 
least one neutralising antibody and at least one blocking antibody for the variant 
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polypeptide compared with the naturally occurring MSP-1 sequence may be tested. 
Ideally more than one of each type of antibody should be used, for example two or three. 



The ability of antibodies to bind to the variant and wild-type polypeptides may be 
5 determined using any one of a variety of methods available in the an for determining 
antibody-epitope binding. One such method, described in the Examples, involves the use 
of MSP-1 sequences expressed as fusion proteins with a protein tag such as glutathione-S- 
transferase (GST). These GST-fusion proteins are typically immobilised to a solid phase 
such as glutathione sepharose beads or a BlAcore sensor chip. Binding of antibodies, 
10 such as monoclonal antibodies, to the fusion proteins may be determined using standard 
techniques such as Western blotting and/or by labelling the antibodies with a radioactive 
label such as l25 L The use of BlAcore technology allows easy quantitation of the results. 

Preferably, the reduction in binding of at least one of the blocking antibodies tested is at 
15 least 50% compared to wild-type MSP-1, more preferably at least 75, 80 or 90%, typically 
as assessed using recombinantly expressed MSP- 1 immobilised to a BlAcore sensor chip. 
By contrast, the binding of at least one, for example at least two or three, of the 
neutralising antibodies tested, more preferably at least half of the neutralising antibodies 
tested, more preferably substantially all of the neutralising antibodies tested is reduced by 
20 less than 50%, more preferably less than 25%. The number of neutralising antibodies that 
need be tested to confirm compliance with the test criteria will not typically exceed from 
three to five different antibodies (three antibodies are used in the Examples). In a 
particularly preferred embodiment the binding of at least one neutralising antibody is 
increased by at least 1 0%. 

25 

The results given in Table 2 in the Examples provide partial guidance to the skilled person 
as to which residues may be modified to produce a variant MSP-1 of the invention. 
However, the provision herein for the first time of the three dimensional solution structure 
of MSP-1 19 provides the skilled person with further detailed guidance as to which residues 
30 may be altered. In particular, epitopes are expected to be exposed to the aqueous 
environment on the exterior of the MSP-1 19 fragment. Consequently, the precise 
structural information provided which teaches the position of surface exposed amino acids 
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ailows the skilled person to target those residues for modification. This data is given in 
Tables A/B and has also been submitted to the Protein Data Bank (PDB Accession no. 
1CEJ). It enables the skilled person to identify the precise location of individual amino 
acids in the three dimensional structure. Typically, the data is loaded into suitable 
5 software, well-known in the an such as Insight IL MOLSCRIPT GRAS P and RASMOL. 

Further, knowing the location of a modification in the 3 -dimensional structure which 
affects the binding of a blocking antibody without affecting the binding of the neutralising 
antibodies, it is possible to identify other residues that are on the surface and in the 

10 vicinity of the original modification and which may be easily modified to further improve 
the properties of a modified protein. These residues may be in either the first or the second 
EGF-like motifs or in the sequence between them. Since it is known that an antibody 
binding site can encompass a volume that corresponds approximately to the range of 5 to 
8 amino acids, it is clear that modifications of these adjacent residues may also affect the 

1 5 affinity of the protein for the blocking antibodies. Once an adjacent amino acid has been 
identified it can be modified according to the principles outlined above and the 
contribution of the modification to the overall antigenicity and immunogenicity of the 
protein, either alone or in combination with other modifications, can be assessed. Those 
changes that contribute to a reduced affinity for the blocking antibodies, without a 

20 substantial affect on binding of the neutralising antibodies can be incorporated into the 
improved protein. This can be a reiterative process. 

In addition, the 3D NMR structure will enable the skilled person to carry out preliminary 
computer modelling studies of MSP- 1 19 variants with specific modifications so that, for 
25 example variants that cannot fold properly may be discarded. This will assist in 
minimising the number of candidate MSP- 1 19 variants that need be tested. 

Thus the present invention also provides a computer readable medium having stored 
thereon a model of the MSP- 1 19 NMR structure. In a preferred embodiment, said model is 
30 built from all or some of the NMR data shown in Tables A and B. 

Variants of the present invention may optionally include additional MSP- 1 sequences, in 
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particular regions of the MSP-I33 region of MSP-I42 to confer additional immunogenicity 
to the variant. Furthermore, additional sequences known to contain and promote T cell 
responses are advantageously included (i.e. T cell epitopes). Other modifications may 
also be made that increase immunogenicity such as modifications that alter the pathway of 
5 antigen processing and presentation. 

Polypeptide variants of the invention are typically made by recombinant means, for 
example as described below. However they may also be made by synthetic means using 
techniques well known to skilled persons such as solid phase synthesis. Proteins of the 

10 invention may also be produced as fusion proteins, for example to aid in extraction and 
purification. Examples of fusion protein partners include glutathione-S-transferase 
(GST), 6xHis, GAL4 (DNA binding and/or transcriptional activation domains) and 
P-galactosidase. It may also be convenient to include a proteolytic cleavage site between 
the fusion protein partner and the protein sequence of interest to allow removal of fusion 

15 protein sequences. Preferably the fusion protein will not hinder the immunogenicity of the 
MSP-1 variant. 

Polypeptides of the invention may be in a substantially isolated form. It will be 
understood that the polypeptide may be mixed with carriers or diluents which will not 
20 interfere with the intended purpose of the polypeptide and still be regarded as substantially 
isolated. A polypeptide of the invention may also be in a substantially purified form, in 
which case it will generally comprise the polypeptide in a preparation in which more than 
90%, e.g. 95%, 98% or 99% of the polypeptide in the preparation is a polypeptide of the 
invention. 

25 

B. Polynucleotides and vectors 

As discussed above, the variants of the present invention may be produced recombinantly 
using standard techniques. Thus, the present invention also provides a polynucleotide 
30 encoding a polypeptide MSP-1 variant of the invention. Polynucleotides of the invention 
may comprise DNA or RNA. They may .also be polynucleotides which include within 
them synthetic or modified nucleotides. A number of different types of modification to 
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oligonucleotides are known in the an. These include methylphosphonate and 
phosphorothioate backbones, addition of acridine or polylysine chains at the 3' and/or 5' 
ends of the molecule. For the purposes of the present invention, it is to be understood that 
the polynucleotides described herein may be modified by any method available in the an. 
5 Such modifications may be carried out in order to enhance the in vivo activity or life span 
of polynucleotides of the invention. It will be understood by a skilled person that 
numerous different polynucleotides can encode the same polypeptide as a result of the 
degeneracy of the genetic code. 

10 Polynucleotides of the invention comprise can be incorporated into a recombinant 
replicable vector. The vector may be used to replicate the nucleic acid in a compatible 
host cell. Thus in a further embodiment, the invention provides a method of making 
polynucleotides of the invention by introducing a polynucleotide of the invention into a 
replicable vector, introducing the vector into a compatible host cell, and growing the host 

15 cell under conditions which bring about replication of the vector. The vector may be 
recovered from the host cell. Suitable host cells include bacteria such as E. coli, yeast, 
mammalian cell lines and other eukaryotic cell lines, for example insect Sf9 cells. The 
host cell may be a methylotrophic yeast such as Pichia pastoris. 

20 The coding sequence of natural or variant MSP polypeptides (including the polypeptide of 
the invention) may be modified for optimal expression in a host cell. For example, 
secondary modification such as N-glycosylation may be prevented by removal of 
sequences necessary for such modification. The sequence of the polypeptide may 
alternatively or in addition be modified with respect to codon usage for optimal expression 

25 in the host cell. Methods of mutagenising a sequence are known in the art; alternatively, 
the modified coding sequence may be generated by means of PCR gene assembly using 
overlapping synthetic oligonucleotides (Stemmer et al., 1995; Withers-Martinez et al„ 
1999). 

30 Preferably, a polynucleotide of the invention in a vector is operably linked to a regulatory 
sequence that is capable of providing fox the expression of the coding sequence by the 
host cell, i.e. the vector is an expression vector. The term "operably linked" refers to a 
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juxtaposition wherein the components described are in a relationship permitting them to 
function in their intended manner. A regulatory sequence "operably linked" to a coding 
sequence is ligated in such a way that expression of the coding sequence is achieved under 
condition compatible with the control sequences. 

5 

Such vectors may be transformed or transfected into a suitable host cell using standard 
techniques above to provide for expression of a polypeptide of the invention. This 
process may comprise culturing a host cell transformed with an expression vector as 
described above under conditions to provide for expression by the vector of a coding 
10 sequence encoding the polypeptides, and optionally recovering the expressed 
polypeptides. 

The vectors may be for example, piasmid or virus vectors provided with an origin of 
replication, optionally a promoter for the expression of the said polynucleotide and 
15 optionally a regulator of the promoter. The vectors may contain one or more selectable 
marker genes, for example an ampicillin resistance gene in the case of a bacterial piasmid 
- or a neomycin resistance gene for a mammalian vector. Vectors may be used in vitro, for 
example for the production of RNA or used to transfect or transform a host cell. The 
vector may also be adapted to be used in vivo, for example in a method of gene therapy. 

20 

Promoters/enhancers and other expression regulation signals may be selected to be 
compatible with the host cell for which the expression vector is designed. For example, 
prokaryotic promoters may be used, in particular those suitable for use in E. coli strains 
(such as E. coli HB101 or DH5a). 

25 

When expression of the polypeptides of the invention in carried out in mammalian cells, 
either in vitro or in vivo, mammalian promoters may be used. Tissue-specific promoters 
may also be used. Viral promoters may also be used, for example the Moloney murine 
leukaemia virus long terminal repeat (MMLV LTR), the promoter rous sarcoma virus 
30 (RSV) LTR promoter, the SV40 promoter, the human cytomegalovirus (CMV) IE 
promoter, herpes simplex virus promoters or adenovirus promoters. All these promoters 
are readily available in the art. 
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C. Administration 

The variant MSP-1 polypeptides of the present invention and nucleic acid molecules may 
5 be used to treat or prevent malaria in animals, specifically humans. 

The polypeptides of the invention may be administered by direct injection. Preferably the 
polypeptides are combined with a pharmaceutical ly acceptable carrier or diluent to 
produce a pharmaceutical composition. Suitable carriers and diluents include isotonic 

10 saline solutions, for example phosphate-buffered saline. The composition may be 
formulated for parenteral, intramuscular, intravenous, subcutaneous, intraocular or 
transdermal administration. Typically, each polypeptide is administered at a dose of from 
0.01 to 30 fig/kg body weight, preferably from 0.1 to 10 |ag/kg, more preferably from 0.1 
to 1 ^tg/kg body weight. It is also possible to use antibodies prepared using the 

15 polypeptides of the invention, as described below, in treating or preventing Plasmodium 
infection. Neutralising antibodies, or fragments thereof which retain specificity for 
Plasmodium antigens, can be administered in a similar manner to the polypeptides of the 
invention. 

20 The polynucleotides of the invention may be administered directly as a naked nucleic acid 
construct. When the expression cassette is administered as a naked nucleic acid, the 
amount of nucleic acid administered is typically in the range of from 1 \ig to 10 mg, 
preferably from 100 jag to 1 mg. 

25 Uptake of naked nucleic acid constructs by mammalian cells is enhanced by several 
known transfection techniques for example those including the use of transfection agents. 
Example of these agents include cationic agents (for example calcium phosphate and 
DEAE-dextran) and lipofectants (for example lipofectam™ and transfectam™). 
Typically, nucleic acid constructs are mixed with the transfection agent to produce a 

30 composition. 
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Alternatively. the polynucleotide may be administered as pan of a nucleic acid vector, 
including a plasmid vector or viral vector, such as a vaccinia virus vector. When the 
polynucleotide of the invention is delivered to cells by a viral vector of the invention, the 
amount of virus administered is in the range of from 10 J to 10 l0 pfii, preferably from 10 3 
5 to 10 s pfu. more preferably from 10 6 to 10 7 pfu. When injected, typically 1-10 ^1 of virus 
in a pharmaceutically acceptable suitable carrier or diluent is administered. 

Preferably the delivery vehicle (i.e. naked nucleic acid construct or viral vector comprising 
the polynucleotide for example) is combined with a pharmaceutically acceptable carrier or 
10 diluent to produce a pharmaceutical composition. Suitable carriers and diluents include 
isotonic saline solutions, for example phosphate-buffered saline. The composition may be 
formulated for parenteral, intramuscular, intravenous, subcutaneous, intraocular or 
transdermal administration. 

15 The routes of administration and dosages described are intended only as a guide since a 
skilled practitioner will be able to determine readily the optimum route of administration 
and dosage for any particular patient and condition. 

D. Preparation of Vaccines 

20 

Vaccines may be prepared from one or more polypeptides of the invention. They may 
also include one or more immunogenic Plasmodium polypeptides known in the art. Thus 
a vaccine of the invention may comprise one or more polypeptides of the invention and 
optionally, one or more polypeptides selected from, for example, the asexual blood stage 

25 proteins: apical merozoite antigen- 1, erythrocyte binding antigen 175, erythrocyte 
membrane protein- 1; the hepatic stage proteins: liver stage antigens 1 and 3; the 
sporozoite stage proteins: circumsporozoite protein , thrombospondin related adhesive 
protein; and the sexual stage proteins Pfs25 and Pfs28 polypeptides and immunogenic 
fragments thereof. Preferably, the other immunogenic Plasmodium polypeptides known 

30 in the art do not contain wild type MSP-1 19 sequences. 

The preparation of vaccines which contain an immunogenic polypeptide(s) as active 
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ingredient(s), is known to one skilled in the an. Typically, such vaccines are prepared as 
injectables. either as liquid solutions or suspensions: solid forms suitable for solution in. 
or suspension in. liquid prior to injection may also be prepared. The preparation may also 
be emulsified, or the protein encapsulated in liposomes. The active immunogenic 
5 ingredients are often mixed with excipients which are pharmaceutical^ acceptable and 
compatible with the active ingredient. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, 
the vaccine may contain minor amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants which enhance the 

10 effectiveness of the vaccine. Examples of adjuvants which may be effective include but 
are not limited to: aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine 
(thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as 
nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminy l-L-alanine-2-( 1 -2*-dipalmitoyl-sn- 
glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 19835A, referred to as MTP-PE), 

15 and RIBI, which contains three components extracted from bacteria, monophosphoryl 
lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2% 
squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be determined by 
measuring the amount of antibodies directed against an immunogenic polypeptide 
containing an MSP-1 antigenic sequence resulting from administration of this polypeptide 

20 in vaccines which are also comprised of the various adjuvants. 

The vaccines are conventionally administered parenterally, by injection, for example, 
either subcutaneously or intramuscularly. Additional formulations which are suitable for 
other modes of administration include suppositories and, in some cases, oral formulations. 

25 For suppositories, traditional binders and carriers may include, for example, polyalkylene 
glycols or triglycerides; such suppositories may be formed from mixtures containing the 
active ingredient in the range of 0.5% to 10%, preferably 1% to 2%. Oral formulations 
include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 

30 carbonate, and the like. These compositions take the form of solutions, suspensions, 
tablets, pills, capsules, sustained release formulations or powders and contain 10% to 95% 
of active ingredient, preferably 25% to 70%. Where the vaccine composition is 
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lyophilised, the iyophilised material may be reconstituted prior to administration, e.g. as a 
suspension. Reconstitution is preferably effected in* buffer. 

Capsules, tablets and pills for oral administration to a patient may be provided with an 
5 enteric coating comprising, for example, Eudragit "S", Eudragit "L". cellulose acetate, 
cellulose acetate phthalate or hydroxypropylmethyl cellulose. 

The polypeptides of the invention may be formulated into the vaccine as neutral or salt 
forms. Pharmaceutically acceptable salts include the acid addition salts (formed with free 

10 amino groups of the peptide) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids such as acetic, oxalic, 
tartaric and maleic. Salts formed with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino 

15 ethanol, histidine and procaine. 

E. Dosage and Administration of Vaccines 

The vaccines are administered in a manner compatible with the dosage formulation, and in 
20 such amount as will be prophylactically and/or therapeutically effective. The quantity to 
be administered, which is generally in the range of 5 |ag to 250 fag of antigen per dose, 
depends on the subject to be treated, capacity of the subject's immune system to synthesise 
antibodies, and the degree of protection desired. Precise amounts of active ingredient 
required to be administered may depend on the judgement of the practitioner and may be 
25 peculiar to each subject. 

The vaccine may be given in a single dose schedule, or preferably in a multiple dose 
schedule. A multiple dose schedule is one in which a primary course of vaccination may 
be with 1-10 separate doses, followed by other doses given at subsequent time intervals 
30 required to maintain and or reinforce the immune response, for example, at 1 to 4 months 
for a second dose, and if needed, a subsequent dose(s) after several months. The dosage 
regimen will also, at least in pan, be determined by the need of the individual and be 
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dependent upon the judgement of the practitioner. 

In addition, the vaccine containing the immunogenic MSP-1 antigen(s) may be 
administered in conjunction with other immunoregulatory agents, for example. 
5 immunoglobulins. 

F. Preparation of antibodies against the polypeptides of the invention 

The variant MSP-1 polypeptides prepared as described above can be used to produce 
10 antibodies, both polyclonal and monoclonal. If polyclonal antibodies are desired, a 
selected mammal (e.g., mouse, rabbit, goat, horse, etc.) is immunised with an 
immunogenic polypeptide bearing an MSP-1 epitope(s). Serum from the immunised 
animal is collected and treated according to known procedures. If serum containing 
polyclonal antibodies to an MSP-1 epitope contains antibodies to other antigens, the 
15 polyclonal antibodies can be purified by immunoaffinity chromatography. Techniques for 
producing and processing polyclonal antisera are known in the art. 

Monoclonal antibodies directed against MSP-1 epitopes in the polypeptides of the 
invention can also be readily produced by one skilled in the an. The general methodology 
20 for making monoclonal antibodies by hybridomas is well known. Immortal antibody- 
producing cell lines can be created by cell fusion, and also by other techniques such as 
direct transformation of B lymphocytes with oncogenic DNA, or transfection with 
Epstein-Barr virus. Panels of monoclonal antibodies produced against MSP-1 epitopes 
can be screened for various properties; i.e., for isotype and epitope affinity. 

25 

The polypeptides of the invention can also be used to select for human monoclonal 
antibodies using the variable regions of immunoglobulin heavy and light chains cloned in 
the form of a phage display library, preferably from individuals who have been previously 
exposed to a natural malaria infection. 

30 

Antibodies, both monoclonal and polyclonal, which are directed against MSP-1 epitopes 
are particularly useful in diagnosis, and those which are neutralising are useful in passive 
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immunotherapy. Monoclonal antibodies, in particular, may be used to raise anti-idiotype 
antibodies. Anti-idiotype antibodies are immunoglobulins which carry an "internal 
image*' of the antigen of the infectious agent against which protection is desired. 

5 Techniques for raising anti-idiotype antibodies are known in the art. These anti-idiotype 
antibodies may also be useful for treatment of Plasmodium infections, as well as for an 
elucidation of the immunogenic regions of MSP-1 antigens. It is also possible to use 
fragments of the antibodies described above, for example, F(ab") 2 , Fab, Facb and scFv 
fragments. 

10 

It should be appeciated that features from various sections, aspects and embodiments of 
the invention as described above are generally equally applicable to other sections, aspects 
and embodiments mutatis mutandis. 

15 The invention will now be further described by way of Examples, which are meant to 
serve to assist one of ordinary skill in the art in carrying out the invention and are not 
intended in any way to limit the scope of the invention. The Examples refer to the 
Figures. In the Figures: 

20 Detailed Description of the Figures 

Figure 1 - MSP-1 sequences aligned according to the EGF-like motif consensus. Top 
sequence: P. falciparum (SWISS-PROT MSP1 PLAFW). Second sequence: P. vivax 
Belem strain (PIR A45604). Third sequence: human EGF (PDB legf). Fourth sequence: 
25 EGF-like domain consensus (Prosite EGF1). Bottom sequence: 14 residue EGF core 
region used for structure alignment in Figure 6. Black highlighting indicates conserved 
residues of the EGF-like domain. Dark shading shows hydrophobic residues at the EGF- 
module pair interface in the P. falciparum, and corresponding conserved residues in the 
P. vivax sequence. 

30 

Figure 2 - Sample of multidimensional heteronuclear NOESY experiments showing 
planes containing NOE connections to the MSP-1 C-terminal fragment Lys35 NH proton. 
Top: I3 C (D4) and *H(D3) plane from the 4D-[ l3 C]-HMQC-NOESY-[ ,5 N]-HSQC 
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experimem. taken at the chemical shift values of Lys35 NH in l *N(D2) and 'HfDl). 
Bonom: strip from the 3D [ l5 N]-NOESY-HSQC at the l H chemical shift value of Lys35 
NH (vertical axis, Dl) taken at the plane of its l "N (D3) value. The horizontal 'H axis is 
aligned with that of the top spectrum. The weak cross-peaks at 2.72 and 3.01 ppm in the 
5 3D spectrum do not show corresponding cross-peaks in the 4D spectrum because of the 
lower signal-to-noise ratio in the latter. These peaks have been assigned as the cross-peaks 
between Lys35 NH and Asn44 Hp 2 (2.72 ppm), and Cys30 Hps and/or Cys41 H32 (3.01 
ppm). 

10 Figure 3 - Stereo drawing showing the backbone C, N, C a atoms of the 32 refined 
structures in the final ensemble. The domain- 1 is on the left (red), with domain-2 on the 
right (green), and both the N- and C-termini are near the bottom. 

Figure 4 - MOLSCRIPT picture of the most representative model of the ensemble, 
15 showing the backbone C a trace, antiparallel 0-sheet elements, and disulphide bridges (S Y 
atoms in yellow). Domain- 1, red; Domain-2, green. 

Figure 5 - Alignment of typical EGF-like family members with the fitpdb program, using 
the 14 amino acid "reduced core" consensus (Bersch et a/., 1998) (see Figure 1). The 

20 aligned backbone segment in each structure is white. The structures are aligned relative to 
the most representative structure of the group (factor Xa), with increasing divergence from 
left to right. Numbers indicate the imsd value of the aligned C, N, C a atoms. PDB 
identification codes: factor Xa (crystal structure), lhcg; Complement Clr component, 
lapq (14 th model); human EGF, legf (11 th model); fibrillin-1, domains-32 and -33_lemn 

25 (minimized average structure); transforming growth factor-a r 2tgf (minimized average 
structure); MSP-1 domains-1 and -2, this study. 

Figure 6 - Backbone ribbon view of fibrillin- 1 versus MSP-1 EGF module pair 
arrangements. Fibrillin-1 (lemn) cyan (domain-32) and magenta (domain-33) (Downing 
30 et a/., 1996); MSP-1 domain- 1 (yellow) and domain-2 (green). Structures were aligned as 
in Figure 6 by the core consensus of the N-terminal domain of each pair. The bound Ca 2 * 
ions in the fibrillin-1 structure are shown as magenta spheres. 
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Figure 7 - Two views, a and b, (rotated 180° aboutthe y-axis) of the electrostatic potential 
surface of the MSP-1 EGF module pair, calculated with GRASP. Red indicates negative 
charge, blue indicates positive charge, and white is neutral. The orientation of the views 
5 is shown by the adjacent worm diagrams. 

Figure 8 - CPK model of the MSP-1 C-terminal fragment, showing the location of some 
mutations that affect binding of monoclonal antibodies. Domain- 1 is towards the top and 
right sides, and domain-2 towards the bottom left. 

10 

Figure 9 - Examples of the binding of monoclonal antibodies to GST-MSP-1 19 detected by 
Western blotting. The binding of each monoclonal antibody to protein based on the wild 
type sequence and to proteins containing modified sequences is shown. The monoclonal 
antibodies are shown across the top. On the left is shown the proteins: WT, wild type 
15 : sequence; 22, Leu22 to Arg; 26, Glu26 to lie; 15, AsnlS to Arg; 27, Glu27 to Tyr; 31, 
Leu3 1 to Arg; 43, GIu43 to Leu; 27+3 1 +43, Glu27 to Tyr and Leu3 1 to Arg and Glu43 to 
Leu; 1 5+27+3 1 +43, Asnl 5 to Arg and Glu27 to Tyr and Leu3 1 to Arg and Glu43 to Leu. 

Figure 10 - The binding of monoclonal antibodies to GST-MSP-1 19 detected by BIAcore 
20 analysis. The binding of each monoclonal antibody is normalised to 100% binding to 
protein based on the wild type sequence and the binding of proteins containing modified 
sequences is expressed as a percentage of this. WT, wild type sequence; 15, Asn 15— * Arg; 
26, Glu26^Ile; 27, Glu27-+Tyr; 31, Leu31->Arg; 34, Tyr34->Ser; 43 Glu43^Leu. 

25 Figure 11 - The binding of monoclonal antibodies to GST-MSP-1 19 containing multiple 
modifications detected by BIAcore analysis. The binding of each monoclonal antibody is 
normalised to 100% binding to protein based on the wild type sequence and the binding of 
proteins containing modified sequences is expressed as a percentage of this. WT, wild 
type sequence; The combinations contain 3 mutations [27+31+43], or 4 mutations 

30 ([27+31+34+43] and [15+27+31+43]), at each site the changes are those identified in 
Figure 10. 
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Figure 12 - Identification of blocking antibodies using a competitive binding assay and 
immobilised wild type GST-MSP-1 19 The ability of antibodies to compete with the 
binding of mAbs 12.8 and 12.10 to GST-MSP-1 19 was measured using BIAcore analysis. 
Individual antibodies (x-axis ) were bound to the antigen and then the amount of either 
5 12.8 or 12.10 (inhibitory mAb) that could subsequently bind was quantified. The amount 
of binding is presented as a percentage of the total amount of either 12.8 or 12.10 bound in 
the absence of pre-incubation with another antibody. 

Figure 13 - Antibodies induced by immunisation with a modified recombinant MSP- 1 19 
10 assayed for their ability to inhibit secondary processing. Washed 3D7 merozoites were 
either analysed directly without incubation (0 h) or incubated for 1 hour at 37°C in the 
presence of no serum (no serum), i mM PMSF as a control for complete inhibition, 
normal rabbit sera (normal serum), or serum from a rabbit immunised with the 
15+27+31+43 modified protein (immune serum), all at 1:10 dilution in reaction buffer. 
15 The level of MSP- 1 33 released into the supernatant as a results of secondary processing 
was measured using an ELISA method and is represented by Absorbance at 492nm. 

Figure 14. Pichia pastoris codon preference table used for input to the CODOP program. 

20 Figure 15. DNA and protein sequences for the optimized synthetic MSP-142 gene. A: 
Complete sequence designed for optimum codon usage and expression in P. pastoris. B: 
Sequence of the synthetic MSP-119 construct in the expression vector pPIC9K-HXa. 
Uppercase letters: vector sequences, including the His6 tag and factor Xa cleavage site 
(IEGR). Lowercase letters: synthetic MSP-1 19 coding sequence. The cloned sequence in 

25 located at the SnaBI restriction site of the pPIC9K sequence. C: Expressed protein 
sequence of the synthetic MSP-119 construct. The sequence shown is produced as a 
fusion to the pPIC9K a-factor secretion signal, following the kex2/STE13 processing 
sites. The synthetic MSP-119 is in bold-face type. D: Sequence of the MSP-1 33 construct. 
The cloned sequence is located at the Smal site of the pUC118 vector. E: Predicted 

30 protein sequence of the synthetic MSP- 133 construct translation product. 

Figure 16. Gene assembly PGR reactions for the MSP-133 and MSP-119 sequences. 
Reaction 1: 10 j^L aliquots of the assembly reactions. Reaction 2: 20 \iL aliquots of the 
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amplification reactions. The N-terminal and middle fragments were subsequently spliced 
together to form the MSP- 133 synthetic construct. The C-terminal fragment synthesis 
reactions produced the optimized MSP-1 19 construct. 

5 Figure 17. Expression of the synthetic MSP-1 19 protein in P. pastoris. Lanes 1-6: 
trichloroacetic acid precipitates of secreted recombinant protein from culture supernatants, 
without further purification (5 \iL each). Samples from duplicate cultures of three 
independent transformants. Lane 8,9: purified, deglycosylated MSP-1 19 produced from 
the original P. falciparum sequence. Lane 7,10: NOVEX molecular weight markers. 

10 

Figure 18. A: { ! H/ I5 N}-HSQC spectrum of the protein (2.5 mM) expressed from the 
optimized synthetic MSP-1 19 gene. B: Control { l H/ i5 N}-HSQC of deglycosylated protein 
(2.2 mM) expressed from the original P. falciparum sequence (Morgan et al., 1999). 

15 Examples 

Materials and Methods 

Protein expression and stable-isotope labelling for NMR 

20 

The coding sequence of the MSP-1 C-terminal fragment was cloned by polymerase chain 
reaction with Vent polymerase (New England Biolabs) from a plasmid containing the 
Plasmodium falciparum strain T9/94 fragment (Blackman et al., 1991), using primers that 
included codons for a 6 residue N-terminal His tag (CACCATCATCATCATCAC), and 

25 inserted into the SnaBl restriction site of the pPIC9K vector (Invitrogen). The sequence 
corresponds to residues 1526-1621 of the SWISS-PROT entry MSP1 PLAFW (accession 
number P04933). This produced an a-factor fusion protein with the sequence 
. . . KR/E A/E A/YHHHHHHNISQ. . . . S S SN . where the slashes indicate kex2 and STE13 
processing sites. High copy number transformants of the methylotrophic yeast 

30 Komagataella (Pichia) pasioris protease-deficient strain SMD1168 (his4 pep4) were 
isolated by screening for high G41 8 resistance (Clare et al., 1995). 

A Mut~ transformant was grown at 29.4 °C in a shaker-incubator in buffered minimal 
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medium (100 mM potassium phosphate, pH 6.0 ? yeast nitrogen base (0.34 % w/vol) 
(DIFCO: YNB without amino acids and without (NH.O2SO4), biotin (4x10°% w/vol). 
Sigma antifoam 289 (0.01% vol/vol), and carbon and nitrogen sources as described below. 
Unlabelled samples were initially grown in medium containing 1 % w/vol (NH.O2SO4 and 
5 1 % w/vol glycerol, and induced by transfer to medium containing 0.5 % CH3OH as the 
carbon source. Labelled samples were initially grown in medium containing 0.2 % w/vol 
[ 15 N]-(NH4)2S0 4 (Isotech), and 0.5 % w/vol glucose or [ 13 C 6 ]-glucose (Isotech), and 
induced by transfer to medium containing as carbon source 0.5 % w/vol CH3OH or [ I3 C]- 
CH3OH (Isotech). The initial cultures were grown in 150 ml to a density of -10 ODsoo, 

10 then harvested and resuspended in methanol medium at 1 OD600 in a volume of 1.5 L. 
Methanol-induced cultures were grown for 4 d ? with daily addition of 7.5 ml CH3OH or 
[ 13 C]-CH 5 OH, to a final density of -18 ODeoo- This protocol produced a maximum yield 
of 24 mg/L of purified, 13 C/ I5 N uniformly labelled protein at the final stage (see below). 
The YNB-based medium produced about 3 -fold higher yields than the FM22 medium 

15 (Laroche et al., 1 994), for stable-isotope labeling of the MSP-1 C-terminal fragment. 

Cells were removed by low-speed centrifugation, protease inhibitors added 
(COMPLETE™ tablets, Boehringer-Mannheim: 1 tablet/500 ml supernatant), and the 
supernatant was filter-sterilized. The supernatant was concentrated -20-fold by 

20 ultrafiltration in a stirred cell (Amicon, YM3 membrane) at 4 °C. The pH was adjusted to 
7.25 with KOH, and the partially N-glycosylated MSP-1 fragment was deglycosylated for 
72 h at 37 °C with 5000 U PNGaseF (New England Biolabs). The carbohydrate was 
completely removed (as shown by electrophoresis and mass spectrometry), with the Asn 1 
residue presumably converted to Asp in the process. The supernatant was clarified by low- 

25 speed centrifugation, 5 M NaCl was added to a final concentration of 0.3 M, and the 
sample was applied to a 2 ml Ni-NTA affinity column (QIAGEN), washed, and eluted 
with 250 mM imidazole according to manufacturer's instructions. The eluate was 
dialyzed against 50 mM sodium phosphate (pH 6.5). 50 mM NaCL and then passed 
through a 1 ml Hi-Trap Q anion exchange resin (Pharmacia) to remove misfolded MSP-1 

30 that bound to the column. The MSP-1 fragment was characterized by Western blotting and 
electrospray mass spectrometry (data not shown). Two principal species of mass 11607 
and 11807 Da were observed corresponding to the expected fragment, as well as a 
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fragment with an additional N-terminal Glu - Ala dipeptide resulting from incomplete 
STE13 processing of the a-factor secretion signal. * 

Samples for NMR experiments were prepared in either 90 % H 2 O/10 % D 2 0 with 0.01 % 
5 w/vol NaN 3 . or 100 % D 2 0, 50 mM sodium phosphate, 100 mM NaCl at pH 6.5 ? (pH 
uncorrected for deuterium isotope effects), at a concentration of 2.1 to 2.6 mM in 0.6 ml. 
Protein concentration was measured by UV absorbance at 280 nm, using a calculated 
molar extinction coefficient of 5220 liter moP 1 cm" 1 . The protein was demonstrated to be 
monomelic by equilibrium uitracentrifugation of a 0.12 mM sample in the above buffer at 
10 293 K. 

NMR experiments and data processing 

Most of the experiments were performed at 298 K, using Varian Unity and Unity-Plus 
15 spectrometers operating at 600 MHz and 500 MHz respectively. Details of the 
multidimensional experiments (Clore & Gronenborn, 1998) and acquisition parameters 
~ used for resonance assignments and structure determination are given in Tables A/B and 
have been submitted to the Protein Data Bank database (PDB Accession No 1CEJ). 

All spectra were processed using Felix 95.0 or 97.0 (Biosym/MSI) using a 90 degree- or 
72 degree-shifted sinebell-squared window function. Dimensions, zero-filling, and linear 
prediction details are summarized in Tables A/B and in the submission to the 
BioMagResBank. Four dimensional and interleaved spectra were processed in Felix using 
macros written in-house. 

Signal assignments: Sequential assignments were made based on connectivities 
established primarily by CBCA(CO)NH and CBCANH experiments on uniformly l3 C/ 1:) N 
labelled protein. Side chain-spin system assignments were made on the basis of data from 
l3 C/'H -HCCH-TOCSY experiment correlated with information from 15 N/'H-TOCSY- 
HSQC and ^N/'H-NOESY-HSQC. and HNHA and HNHB experiments. Assignments 
were obtained for l H ? I5 N and aliphatic-- 1 3 C signals for 98% of side-chains and 96% of 
backbone amide groups. The list of assignments is given in Tables A/B and in the 
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submission to the Protein Data Bank database (PDB Accession No 1CEJ). The l? N{*H} 
heteronuclear NOE experiment was carried out as described previously (Kay et ai t 1989; 
Polshakovef a/.. 1997). 

5 Distance Restraints: NOE- and ROE-derived distance restraints between backbone and 
side chain amide protons were obtained primarily from the 3D '"N-NOESY-HSQC, 
l5 N-ROESY-HSQC, and 4D l3 C-HMQC-NOESY- l5 N-HSQC experiments. Aliphatic to 
aliphatic proton distance restraints were obtained from a 4D l3 C-HMQC-NOESY- i3 C- 
HSQC experiment. A 3D 13 C-HMQC-NOESY experiment in D 2 0 was used to identify 

10 aliphatic to aromatic proton NOEs and 2D NOESY experiments were used to measure 
aromatic to aromatic proton NOEs. Crosspeaks were quantified by volume integration in 
Felix for 2D and 3D experiments and for the 4D 13 C-HMQC-NOESY- l5 N-HSQC 
experiment, and from peak height measurements in the 4D l3 C-HMQC-NOESY- 13 C- 
HMQC spectra. Crosspeaks were classified as strong, medium and weak and these were 

15 assigned to distance restraints of 0 - 2.8, 0 - 3.6, and 0 - 5.5 A. Restraints from backbone 
amide signals were initially treated in this manner, and then recalibrated more precisely 
using 3D- l3 N-ROESY-HSQC data into four classes involving maximum distances of 2.6, 
3.1, 3.6, and 4.1 A. Restraints to groups of equivalent or non-stereoassigned protons were 
treated by r" 6 summation. Most intraresidue distances (HN-H3 and H a -Hp) were converted 

20 to xi angle restraints as described below and these distance restraints were not included in 
the final list. 

Dihedral Angle Restraints: %\ angles and stereospecific assignments of (3-methylene 
protons were obtained using the grid-search program AngleSearch, with coupling constant 

25 and intraresidue ROE distance information (Polshakov et aL, 1995). The coupling 
constant information was provided by HNHB and HN(CO)HB spectral intensities for 
J J(HN-H P ) and J J(CO-H|j), and intraresidue distances (HN-Hp, H a -H p ) were obtained from 
3D l5 N-ROESY-HSQC and 2D ROESY (D 2 0) experiments. 3 J(HN-H a ) coupling 
constants were obtained from the HNHA experiment. Residues with positive <J> angles (ca. 

30 -60 degrees) were identified by large intraresidue H a crosspeak intensities in the 
HN(CO)HB experiment, and y angles near -60° degrees from strong H a <i-n crosspeaks in 
the HNHB experiment. He and Leu X2 angles and Leu 5 stereoassignments were derived 
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from the LRCH experiment. Minimum ranges of 40 degrees (xi. 7jl) and 50 degrees (4>. 
were used to account for errors and local dynamic effects on the coupling constants. 

Bisulphide Bonding Pattern: An initial set of 20 structures was calculated by simulated 
5 annealing using approximately 550 unambiguous NOE-derived distance restraints and 36 
Xi and <J> dihedral angle restraints but with no hydrogen bonding or disulphide bond 
constraints. The Cys - Cys S r distances in these structures were examined in order to 
establish the probable bonding pattern. Prior to the calculations, the formation of 
disulphide bridges for 4 Cys residues (Cys 12 - Cys28, Cys78 - Cys92) was already 

10 established with high probability by the observation of Hp-Hp NOEs between these pairs 
of Cys residues. Examination of the initial structures confirmed these disulphide bridges 
and also indicated a disulphide bridge between residues Cys30 and Cys41. The third 
disulphide bridge in domain- 1 (Cys7 - Cys 18) could thus be assigned by default although 
the structure of the N-terminus was not well-defined by the NMR data. The best six 

15 structures in terms of total X-PLOR energy and violations indicated that the average Cys - 
Cys S r distance was lowest for the disulphide bonding pattern [1-3, 2-4, 5-6] in each 
domain, and only this combination allowed all Cys residues to form contacts with a 
partner <3.5 A away. Thus, this disulphide bonding pattern was most consistent with the 
experimental data for both domains, and was imposed (initially as NOE-style distance 

20 restraints) in subsequent calculations. The [1-3, 2-4, 5-6] pattern is that expected for an 
EGF-like domain. 

Hydrogen Bonds: Non-exchanging amide groups involved in stable hydrogen bonds were 
identified in spectra of samples examined in 100 % D2O. The corresponding hydrogen 

25 bond acceptors were determined by examining the initial structural ensemble, using the 
Insight II and HBPlus (McDonald et aL, 1994) programs, and hydrogen bond distance 
restraints were included in subsequent calculations. Further hydrogen bonds were 
identified in a similar manner in iterative calculations. Only 10 backbone hydrogen bonds 
in the antiparallel {3 sheets were used as restraints. Two distance restraints were used for 

30 each hydrogen bond, 1.7 - 2.3 A from proton to acceptor, and 3.0 - 3.6 A from donor 
nitrogen atom to acceptor. 
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Structure Calculations 

All the structure calculations were performed following standard protocols for ab initio 
simulated annealing from an extended chain using X-PLOR version 3.843 on a Silicon 
5 Graphics Origin 200 computer. The initial calculations used an initial temperature of 
1000K, and 9000 steps of 5 fs in the restrained molecular dynamics stage. A soft-square 
potential was used for distance restraints. The SHAKE (Ryckaert et al. 9 1977) algorithm 
was employed during molecular dynamics to maintain correct bond lengths. Refinement 
used a square well potential for restraints, and a final slow cooling of 30000 steps of 4 fs 

10 each from 2000K. A modified "parallhdg.pro" force-field parameter set was used, with 
modifications to parameters for Arg and Pro residues, and for hydrogen bonds (Polshakov 
et aL, 1997). Force constants were 50 kcal mol" 1 A' 2 for all distance restraints including 
hydrogen bonds, and 200 kcal mol* 1 rad" 2 for dihedral restraints. The N-terminal sequence 
including the vector-encoded residues and (His)6 tag was excluded from the structure 

15 calculations. All peptide bonds were constrained to be trans. NOE data for all 5 Pro 
residues showed strong H a (i-i)-ProH a crosspeaks, consistent with the trans peptide 
conformation. 

Initial structures were calculated as described above to determine the disulphide bonding 
20 pattern. Then the calculation was repeated with identical NOE-derived distance and 
dihedral angle restraints, with the addition of 6 distance restraints (1.92-3.12 A) 
representing the disulphide bridges. A new set of 50 structures was obtained, from which 
the best 20 structures were selected. The criteria used for selection were that the 
structures were below the median value of both total X-PLOR energy and rms NOE 
25 difference, and had no dihedral angle violations. The resulting structures had good 
geometry and between zero and two NOE violations > 0.5 A. These structures were used 
to assign previously ambiguous NOEs and to determine the hydrogen bonds as described 
above. 

30 The final structure calculation and refinement used an expanded restraint list including 
hydrogen bonds, additional dihedral restraints, stereoassignments of P -methylene and Leu 
6 signals, and more precisely calibrated ROE data (see Table 1). A set of 100 structures 
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was obtained using this list, and 38 structures with 0-2 NOE violations > 0.5 A and no 
dihedral angle violations > 5° were accepted. These 38 structures were refined by the 
slow-cooling procedure described above, producing a final ensemble of 32 accepted 
structures with no NOE violations > 0.5 A and no dihedral angle violations > 5°. These 
5 selection criteria produced an ensemble of structures that extend to the end of the 
continuum of total potential energies in order to include structures having large scale 
correlated motions (Abseher et aL, 1998). Statistics for the final ensemble are given in 
Table 1. Coordinates for the 32 refined structures have been deposited in the Brookhaven 
Protein Data Bank (coordinates ID code lcej; NMR restraints ID code rlcejmr). 

10 

Structures were analyzed during the calculation process using X-PLOR 3.8 (Nilges et a/., 
1991), PROCHECK-NMR/AQUA (Laskowski et aL, 1996), and Insight II for quality of 
agreement with experimental data, precision, geometry, and energy. Models were aligned 
with Insight II and fitpdb. and displayed with Insight II, MOLSCRIPT (Kraulis, 1991), 
15 and GRASP (Nicholls et a/., 1991). 

Table 1. 

A: RESTRAINTS SUMMARY 
20 Number of conformers calculated: 1 00 Number of conformers accepted: 32 
Acceptance criteria: 

No distance violation: > 0.5 A No dihedral angle violation: > 5 0 

25 NOE/ROE distance restraints: 

Intraresidue: 73 Sequential: 222 

Medium range ( 2-4 ): 90 Long range ( > 4 ): 185 
Total: 570 

30 Dihedral angle restraints: 

phi: 25 psi: 33 chi-l:22-- chi-2: 5 Total: 85 
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Hydrogen bonds: 10 Disulphide bonds: 6 



B: STRUCTURE QUALITY 

average s.d. 

5 Total X-PLOR energy (kcalmol 1 ) 168 20 

NOE X-PLOR energy (kcal moi" 1 ) 2 1 8 



rmsdNOE 0.026 0.005 

rmsd dihedral angle 0.236 0.095 

10 rmsd bond length 0.0029 0.0002 

rmsd bond angle 0.357 0.023 

rmsd improper 0.266 0.018 

Backbone rmsd of structured region: (69 residues) 

15 Overall: 1.05 0.28 

Domain-1: 0.81 0.32 

Domain-2: 0.83 0.35 



Ramachandran plot quality (phi/psi angles): 

20 Most favoured 49.5 % 

Additional allowed 42.1% 
Generously allowed 5.6 % 

Disallowed 2.7 % 



25 Monoclonal antibodies (mAbs) 



Anti-MSP-li9 monoclonal anybodies used in this study were : mouse IgG mAbs 1E1, 
1E8, 2F10, 111.2, 111.4 2.2, 5.2, 7.5, 9C8, 12.8, 12.10, 12D11, 117.2, 8A12 (Holder el 
al, 1985; McBride & Heidrich, 1987; Blackman et aL 1987; Guevara Patiiio et al., 1997); 
30 and mouse IgM mAb 5B1 (Pirson & Perkins, 1985). Of these, mAbs 12.8, 12.10 and 5B1 
are neutralising, inhibitory antibodies and 1E1, 2.2, 7.5, 9C8 and 111.4 are blocking 
antibodies. Some antibodies such as 1 1 1 .2 are neither inhibitory nor blocking. 
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Construction of modified MSP-1 iq clones 

The DNA coding the wild type MSP-1 19 domain of Plasmodium falciparum (T9- 
5 94/Wellcome strain) MSP-1 has been cloned in expression vector pGEX-3X to produce 
MSP-1 19 fused to the carboxy-terminus of the Schistosoma japonicum glutathione S- 
transferase (GST) in Escherichia coli (Burghaus & Holder, 1994). Site-directed 
mutagenesis of MSP-1 19 DNA sequence was done in either of two ways. 

10 The first method was a modification of the method of Perrm & Gilliland (1990) to carry 
out polymerase chain reaction (PCR)-mediated site specific mutagenesis. DNA was 
amplified using the plasmid as a template together with one oligonucleotide to introduce 
the point mutation and a 5 ? primer from outside of the MSP-1 19 sequence. The amplified 
product was purified after electrophoresis on an agarose gel and used in a second 

15 amplification step together with a 3 ? primer from outside of the other end of the MSP-1 19 
sequence and the plasmid as template. This second PCR product was digested with the 
restriction enzymes EcoRl and BamRl and the product consisting of the modified MSP- 
1 19 coding sequence was inserted back into pGEX-3X and the products were used to 
transform DH5a cells. 

20 

The second method used the QuikChange™ Site-directed mutagenesis kit from 
Stratagene. Briefly, using the plasmid pGEX-MSP-l !9 as a template, two complementary 
synthetic oligonucleotide primers containing the desired point mutation were designed and 
were extended on the template by temperature cycling with the enzyme Pfy DNA 

25 Polymerase. This incorporation of the oligonucleotide primers results in the generation of 
a mutated plasmid containing staggered nicks in the DNA sequence. Following the 
temperature cycling, the product was treated with Dpnl endonulease which will digest the 
methylated parental DNA template and leaves the mutation-containing newly synthesised 
DNA intact. The DNA incorporating the desired mutation was then transformed into E. 

30 coli strain DH5ct (Life technologies) competent cells where the nicks will be repaired. 

Clones were screened by analysis of restriction enzyme digests and by PCR screening of 
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the insert gene. The DNA sequence of the selected mutant clones was confirmed using a 
PerkinElmer Applied Biosystems ABI 377 automatic sequencer according to the 
manufacturer's instructions. 

5 Expression of the GST-MSP-l jo fusion proteins 

Expression of GST-MSP-I19 was induced with 1 mM isopropyl-(J-D- 
thiogalactopyranoside (IPTG; Melford Laboratories) for 1 hour in the E. coli strain TOPP 
1 (Stratagene). The cells were then harvested by centrifugation and the cell pellet was 

10 resuspended in cell lysis buffer (50 mM Tris-HCl/1 mM EDTA pH 8.0 containing 0.2% 
(v/v) Nonidet P40 (NP40; BDH). Phenylmethylsulphonyl fluoride (PMSF; Sigma) in 
isopropanol was added to a final concentration of 1 mM. The cell suspension was 
sonicated, on ice, using VibraCell sonicator (Sonics & Materials) at 50% duty cycle for 3 
min (six 30 sec pulses with 30 sec in between). The cell lysate was centrifuged at 65000 x 

15 g for 1 hour at 4°C. Supernatant containing soluble GST- fusion protein was applied to a 
glutathione-agrose column (Sigma) and the GST-fusion protein was eluted with 5 mM 
reduced glutathione. The eluted GST-fusion protein was dialysed extensively against 
phosphate buffered saline (PBS) at 4°C. 

20 SDS-PAGE and Western Blottine 

Proteins were analysed by polyacrylamide gel electrophoresis in the presence of sodium 
dodecyl sulphate (SDS-PAGE). Samples were solubilised in SDS-PAGE buffer without 
reducing agents, then fractionated on a homogeneous 12.5% polyacrylamide gel. The pre- 

25 stained low range molecular mass markers (24-102kDa) from Bio-Rad were used as 
markers. When required, SDS-PAGE-fractionated polypeptides were either stained with 
Coomassie Brilliant Blue R-250 (CBB; Sigma) or electrophoretically transferred to 
Optitran BA-S 83 reinforced nitrocellulose (Schleicher & Schull, 0.2 fim pore size) for 
analysis by western blotting. Blots were blocked with 5% BSA, 0.5% Tween 20 in PBS 

30 (PBS-T) for 1 h at room temperature, then washed in PBS-T. Blots were probed with first 
antibodies for 2 h at room temperature, -washed 3 times in PBS-T, and then incubated in 
1/1000 dilution of horse radish peroxidase (HRP)-conjugated sheep anti-mouse IgG (H+L) 
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(ICN Immunobiologicals) or Goat anti-mouse IgM (\i chain) (Sigma) for 1 h at room 
temperature. Blots were then washed 3 times in FBS-T and developed using Super Signal 
Substrate (Pierce) as HRP substrate for 1 min. Blots were then placed in plastic wrap and 
exposed to X-ray film (XB-200, X-ograph Imaging Systems). The films were processed 
5 with an Agfa Gevamatic60 film processor (Agfa). 

Analysis of antibodv-antigen interaction using a BIAcore machine 

GST-MSP-l 19 containing either the wild type or various modified sequences was used to 
10 coat a carboxymethyl dextran hydrogen sensor chip by the following methodology. The 
binding of the GST-MSP-l 19 was via amino groups using EDC/NHS chemistry. 
Immobilisation was done with the amine coupling kit (Pharmacia BIAcore). The CM 
dextran surface was activated with 50 |il of 200 mM l-ethyl-3-(3-dimethylaminopropyl) 
carbodiimide (EDC) and 5 mM N-hydroxysuccinamide (NHS) for 10 min. GST-MSP-l 19 
15 was then coupled to the BIAcore sensor surface using 50 fil of a solution at 100 jig ml" 1 in 
coating buffer (0.0 1M sodium acetate buffer, pH 3.5) for 10 min. Unreacted carboxyl 
groups were blocked by adding 50 jil 1 M ethanolamine, pH 8.5 for 10 min. The cells 
were washed with two pulses of 20 fil 10 mM glycine-HCl, pH 2.8 for 8 min in total to 
remove any non-covalently bound protein. The immobilisation procedure was carried out 
20 at a flow rate of 5 p.1 min" 1 . Measurements were performed on the BIAcore 2000 
instrument. 

RESULTS 

25 Example 1 - Resonance assignments, NMR restraints and structure determination 

The assignments and restraints were obtained as described in Materials and Methods using 
a range of multidimensional heteronuclear experiments with i3 C/ l5 N uniformly labelled 
protein. Sample spectra from 3D and 4D experiments showing NOE connections to the 
30 Lys35 backbone amide NH proton, resolved and unambiguously assigned using the I3 C 
chemical shift information, are shown "in Figure 2. The distance, dihedral angle and 
hydrogen bond restraints used in the final set of structure calculations are summarized in 
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Table 11 A total of 570 unambiguously assigned distance restraints, 85 dihedral angle 
restraints, and 1 0 hydrogen bonds were used in the final set. The assignments and restraint 
list shown in Table A have been submitted to the BioMagResBank database. Three 
disulphide bonds, with the (l-3 r 2-4, 5-6) partem for each domain were experimentally 

5 determined from the NMR data in preliminary calculations as described in Materials and 
Methods, and these were also included in the final refinement. A final set of 32 models 
was calculated and refined using these restraints and these structures are shown in Figure 
3 superimposed on the backbone of the representative structure S rC p. Table 1 shows that 
all 32 models have good geometry and are in good agreement with the experimental data 

10 with no NOE violations > 0.5 A and no dihedral angle violations > 5°. The atomic rmsd 
value for the backbone atoms of the well-structured region (residues 1 5 - 64, 74 - 92) is 
1 .05 A (see Table 1). The local backbone rmsd is highest at the N-terminus (up to Cysl2), 
in the loop Glu65 - Lys73, and following Cys92 at the C -terminus. The Ramachandran 
plot quality is typical of that found for other EGF structures (Doreleijers et al. y 1998). 

15 

Description of the structure 



EGF-domains 

Analysis of the final ensemble by PROCHECK-NMR indicated that each domain contains 
20 a major stretch of antiparallel fi-sheet containing the third and fourth Cys residues of each 
domain, as expected for an EGF-like fold, as well as an additional minor antiparallel 
p-sheet at the C-terminal end of domain- 1, similar to some (but not all) EGF family 
members. These secondary structure features, together with the disulphide bonding 
patterns, can be seen in Figure 4. There is also a well-defined type II tight turn in domain- 
25 1 , with a hydrogen bond from Tyr 34NH proton to Leu3 1 carbonyl oxygen. The normally 
conserved EGF consensus Gly residue in the tight turn is replaced in domain- 1 by a 
residue with a positive <|> angle (Asn33), while the conserved aromatic residue is present 
(Tyr34). There is a probable hydrogen bond between Leu 31NH proton to Asnl5 carbonyl 
oxygen. Domain-2 contains two turns preceding the major (5-sheet, (Asn53 - Cys56, 
30 Asp57 - Ala60), and a final bend from Leu86 - Phe91 with a probable hydrogen bond 
from Asp57 NH proton to the carbonyl oxygen of Ile90 or Gly89. A surface-exposed loop 
from Pro81 to Pro85 replaces the tight turn, while the aromatic residue is not conserved. 
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The large loop at the end of the major b-sheet (Glu65 - Lys73) is relatively disordered, and 
high mobility for the segment Gly68 - Gly71 was confirmed by backbone amide l3 N{ ! H} 
heteronuclear NOE measurements (Barbato et al. 7 1992). The heteronuclear NOE values 
are dramatically reduced for residues in this region. At the N-terminus: the low NOE 
5 intensities correspond to increased mobility compared with the rest of the protein. The 
interdomain linker region from Pro45 to Pro47 is distinct from other EGF-like module 
pairs. The conformations of the disulphide bridges between Cys30 - Cys41 in domain-1, 
and the three Cys - Cys bonds in domain-2 are all left handed spirals (Richardson, 1981). 
Bridges between Cys30 - Cys41, Cys56 - Cys76, and Cys78 - Cys92 are particularly close 
10 to their equivalents in the blood coagulation factor Xa structure (lhcg). The conformations 
of the first two disulphide bridges in the relatively disordered N-terminal segment of 
domain-1 were not determined. 

Figure 5 shows the backbone C, N, C a atom alignments of the two MSP-1 C-terminal 
15 fragment domains made with typical examples of EGF-like domains from several 
proteins, using the fitpdb program. Pairwise alignments showed that the two domains 
from MSP-1 are more similar to the factor Xa structure and its close relative from Clr, 
than to each other or to the other structures tested. The rmsd values for MSP-1 domains 
compared to factor Xa are comparable to those of the more distantly related structures 
20 fibrillin- 1 and transforming growth factor-a. 

The overall fold of each MSP-1 domain is thus similar to typical EGF family members, 
with the turns following the fifth Cys residue roughly equivalent, in spite of the 
divergence from the EGF consensus (C(5)xxGa) where a is a Phe or Tyr -residue. 
25 Although some of the external loops are disordered, the scaffold is quite stable, as 
indicated by the non-exchangeable backbone amides (see above and in Protein Data 
Bank/BioMagResBank submission for details). 

Unlike many EGF-like domains such as fibrillin- 1, the MSP-1 C-terminal fragment lacks 
30 the conserved EGF Ca 2 "-binding sequence and there was no evidence of Ca 2+ binding to 
the MSP-1 C-terminal fragment. The 2D l H-NOESY spectra were virtually identical in 
the absence or presence of 20 mM CaCl 2 , indicating that any binding that might occur has, 
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at most, only a small affect the overall structure. 
Domain interface and surface 

The most striking feature of the MSP-1 C-terminal fragment structure is the interface 
5 between the domains, which consists of several nonpolar amino acids (Phel9, Leu31 ? 
Leu32 ? Leu86, Phe87, Ile90 and Phe91) involved in hydrophobic interactions. These 
residues join the base of the major P -sheet and the tight turn in domain- 1 with the final 
bend from residue 86 to 91 in domain-2. The domain interactions result in the domains 
forming a U-shaped structure which contrasts with structures observed for other pairs of 

10 EGF domains (Downing et al., 1996; Brandstetter et al. 9 1995). For example, in fibrillin- 1, 
the interface between EGF domains 32 and 33 is largely formed by a shared Ca 2 * ligation 
site (Downing et aL, 1996), and the overall structure resembles a rigid rod, with distant Isl- 
and C-termini. This contrasts with MSP-1 where the EGF-like domains are folded against 
each other so that their termini are relatively close together. A comparison of fibrillin- 1 

15 and MSP-1 EGF module pairs is shown in Figure 6. Although both termini of the MSP-1 
C-terminal fragment are somewhat disordered, NOE contacts were observed between 
nuclei in the two ends. The proximity of the C- and N-terminal positions may be 
significant, since it suggests that the proteolytic processing site that produces the C- 
terminal 96 amino acid fragment may be very close to the GPI membrane attachment site 

20 at or near residue 96. This proximity is consistent with the idea that a membrane-bound 
Plasmodium proteinase is responsible for secondary processing. 

The electrostatic potential surface of the MSP-1 C-terminal fragment is shown in two 
views in Figure 7. The surface in Figure 7a is highly charged, especially in the protruding 

25 loop regions 23 - 27, 35 - 40 and 64 - 66. The surface in Figure 7b contains more neutral 
hydrophilic residues as well as a small hydrophobic patch from Pro85 - Phe87 near the 
center of the surface. In the future, such information could assist in understanding how 
these different surfaces may be involved in interactions with the rest of the MSP-1 
precursor, the processing proteinase, other proteins on the merozoite surface, or unknown 

30 targets on the erythrocyte or parasite vacuolar membrane surfaces. 
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Primary sequence conservation 

The residues involved in the hydrophobic domain interface in P. falciparum are also 
shown in Figure 1. together with corresponding residues in MSP-1 of the less virulent 
5 human malaria parasite, P. vivax (Del Portillo et aL, 1991: Gibson et al. 9 1992). 
Extensive conservation of the interface residues (with conservative substitutions) suggests 
that P. vivax and perhaps other Plasmodium species as well, may have a similar U-shaped 
EGF module pair arrangement. Another feature of the P. vivax sequence, also seen in 
other Plasmodium species, is the single disulphide bond deficiency in the first EGF-like 
10 domain resulting from the absence of cysteine residues equivalent to the P. falciparum 
Cysl2 and Cys28. 

P. falciparum dimorphic sites 

15 Five dimorphic sites have been observed in the P. falciparum MSP-1 19 C-terminal 
fragment from different isolates (Qari et aL, 1998). Several observations can be made 
about the position of these sites on the MSP-1 structure. Two sites, Glnl4/Glul4 and 
Lys61/Thr61, involve residues in relatively well-structured backbone regions, with 
surface-exposed hydrophilic or charged side-chains. A pair of adjacent sites, with the 

20 sequence variants Asn70 - Gly71/Ser70 - Arg71, occurs in the disordered loop of domain- 
2, within a segment (residues 68-71) that has been shown to be highly mobile. The 
region from Glu65 to Lys73 also appears to be the most variable region among different 
Plasmodium species (Daly et al, 1992; Holder et aL, 1992). Finally, the fifth site has a 
substitution between hydrophobic residues (Leu86/Phe86). This partially-exposed side- 

25 chain is located at the hydrophobic domain interface, and the conservative substitution is 
consistent with a role in this interaction. 

Example 2 - Mutation and Monoclonal Antibody Binding studies 

30 As a step towards understanding antibody interactions with the MSP-1 C-terminal 
fragment, the effect of engineered point mutations (within domain- 1) on antibody binding 
has been studied. Amino acid substitutions were made that consisted of radical changes. 
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These radical changes consisted of. for example, replacing an aliphatic residue with a 
charged polar residue, replacing a positively charged side chain with a negatively charged 
side chain, replacing an amino acid with a large side chain with an amino acid with a 
smaller or no side chain (glycine), replacing a polar amino acid with a charged polar 
5 amino acid, replacing a polar amino acid with an aromatic amino acid, replacing a large 
aromatic amino acid with an amino acid with a small side chain, and replacing cysteine 
residues that are involved in disulphide bonds. 

Four individual amino acid substitutions shown in Figure 8, each completely abolish 
10 binding of one or more mAbs to the mutant fragment, as detected by Western blotting. 
The Glu26 mutation, shown in cyan, is closest to the N-terminal proteolytic processing 
site (magenta) at Asnl, and is the only one of this group of mutations that affects binding 
of a processing-inhibitory antibody, i.e. one that is capable of preventing both proteolytic 
processing of the MSP-1 precursor and erythrocyte invasion in vitro. The other three 
15 mutations abolish binding of blocking antibodies that bind to the native C-terminal 
fragment and interfere with the binding of processing-inhibitory antibodies. 

Additional mutations were made based on the immunochemical analyses and the tertiary 
structure of the molecule, and the binding of the mAbs was assessed by western blotting 
20 and BIAcore analysis. The results are summarised in Table 2. The results of the binding 
of selected mAbs to the modified proteins as detected by Western blotting are shown in 
Figure 9, and by BIAcore analysis in Figure 10. Some individual amino acid changes have 
no effect on the binding of any of the mAbs tested (for example Leu22 to Arg). Other 
substitutions affect the binding of one or more mAbs. 

25 

Of particular interest are those changes that prevent the binding of blocking antibodies but 
have no effect on the binding of the inhibitory antibodies. For example, replacement of 
Asnl 5 by Arg prevents the binding of mAb 7.5, replacement of Glu27 by Tyr prevents the 
binding of mAb 2.2, replacement of Leu31 by Arg prevents the binding of mAb 1E1, 
30 replacement of Tyr34 by Ser prevents the binding of mAb 7.5, and replacement of Glu43 
by Leu prevents the binding of mAb 111.4, 
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Several combinations of substitutions that prevent the binding of blocking antibodies but 
do not affect the binding of inhibitory antibodies' were made in single proteins (Table 2 
and Figure 1 1). In the first Glu27— >Tyr ? Leu31— >Arg and Glu43— >Leu were combined, in 
the second Glu27->Tyr ? Leu3 1 — ► Arg, Tyr34-»Ser ? and Glu43— »Leu were combined, and 
5 the third Asnl5->Arg ? Glu27-»Tyr, Leu31->Arg and Glu43->Leu were combined. None 
of these modified proteins bound any of the blocking antibodies but continued to bind the 
inhibitory antibodies. We propose that the mutant proteins will induce a polyclonal 
response that is more inhibitory than that induced by the wild type protein. 

10 The modified recombinant proteins will also be used to affinity select antibodies from 
pooled serum from individuals exposed to malaria. We hypothesise that the modified 
proteins will select less blocking antibody than the wild type protein and that therefore 
these selected antibodies will be more effective in inhibiting parasite invasion in vitro and 
secondary processing. 

15 

In the first EGF-like domain of MSP-1 from the rodent, primate and P. vivax malaria 
parasites, cysteines 2 and 4 are not present. We have replaced this cysteine pair (Cysl2 and 
Cys28) in the P. falciparum protein. This does not have appear to have any effect on the 
binding of any of the inhibitory antibodies, but does abolish the binding of the blocking 

20 antibody mAb 2.2. We propose that one reason why the proteins from these other malaria 
parasites are more immunogenic is that T cell recognition is more effective or that 
processing by antigen processing cells proceeds by a different degradation pathway that 
drives the fine specificity of the antibody response in a more productive direction (see for 
example Egan et aL, 1997). Removal of the cysteine pair may improve the 

25 immunogenicity of the modified protein and this will be assessed by measuring the level 
of antibodies induced by the P. falciparum protein without the two cysteines with the level 
of antibodies induced by the wild type protein. 
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TABLE A 



* 13-10-98 

# merozoite surface protein- 1 (MSP-1) Plasmodium falciparum (C-terminai fragment) 

# Reference: 'H : DSS=0.000 dioxane=3.755 (internal) 

* I5 N: indirect 

* l3 C: indirect 

f? 25C pH 6.5 50mM NaP04 lOOmM NaCI 90%H2O/10%D2O 

* —FORMAT— 

?? BioMagResBank 

ii 

rr 

# The original sequence entered was: 

NISQHQCVKKQCPQNSGCFRHLDEREECKCLLNYKQEGDKCVENPNPTCNENNGGCDADAKCTEEDSGSNGK 

KITCECTKPDSYPLFDGIFCSSSN 

# 

# Expressed in NMR-STAR, this sequence is: 
_Molj*esidue_sequence 

N I S QHQC VKKQC PQNSGCFR 
HLDEREECKCLLNYKQEGDK 
CVENPNPTCNENNGGCDADA 
KCTEEDSGSNGKKITCECTK 
PDSYPLFDGIFCSSSN 

loop_ 

_Residue_seq_code 
_Residue_author_seq_code 
Residue label 



1 


0 


ASN 


2 


0 


ILE 


3 


0 


SER 


4 


0 


GLN 


5 


0 


HIS 


6 


0 


GLN 


7 


0 


CYS ■ 


8 


i 


VAL 


9 


0 


LYS 


10 


0 


LYS 


11 


e 


GLN 


12 


0 


CYS 


13 


0 


PRO 


14 


0 


GLN 


15 


0 


ASN 


16 


0 


SER 


17 


0 


GLY 


13 


0 


CYS 


19 


0 


PHE 


20 


0 


. ARG 


21 


@ 


HIS 


22 


0 


LEU 


23 


0 


ASP 


24 


0 


GLU 


25 


0 


ARG 


26 


0 


GLU 


27 


0 


GLU 


28 


0 


CYS 


29 


0 


LYS 


30 


0 


CYS 


31 


0 


LEU 


32 


0 


LEU 


33 


0 


ASN 


34 


0 


TYR 


35 


0 


LYS 


36 


0 


GLN 


37 


0 


GLU 


38 


0 


GLY 


39 


0 


ASP 


40 


0 


LYS 


41 


0 


CYS 


42 


0 


VAL 


43 


0 


GLU 


44 


0 


ASN 


45 


0 


PRO 


46 


0 


ASN 


47 


0 


PRO 


48 


0 


THR 


49 


0 


CYS 


50 


0 


ASN 


51 


0 


GLU 


52 


0 


ASN 


53 


0 


ASN 


54 


0 


GLY 


55 


0 


GLY 


56 


0 


CYS 


57 


0 


ASP 


58 


0 


ALA 


59 


0 


ASP 


60 


0 


ALA 


61 


0 


LYS 


62 


0 


CYS 


63 


0 


THR 


64 


0 


GLU 


65 


0 


GLU 


66 


0 


ASP 


67 


0 


SER 


68 


0 


GLY 


69 


0 


SER 


70 


0 


ASN 


71 


0 


GLY 


72 


0 


LYS 


73 


0 


LYS 


74 


0 


ILE 


75 


0 


THR 


76 


0 


CYS 


77 


0 


GLU 


78 


0 


CYS 


79 


0 


THR 


80 


0 


LYS 


81 


0 


PRO 


82 


0 


ASP 


83 


0 


SER 


84 


0 


TYR 


85 


0 


PRO 


36 


0 


LEU 


87 


0 


PHE 


88 


0 


ASP 


89 


0 


GLY 


90 


0 


ILE 


91 


0 


PHE 


92 


0 


CYS 


93 


0 


SSR 


94 


0 


SER 


95 


0 


SER 



96 0 ASN 
stop_ 
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^^###########################^*######## 

# Chemical Shift Ambiguity Code Definitions =* 

2 Codes Definition ^ 

# 1 Unique ~ 
=? 2 Ambiguity of geminal atoms or seminal methyl £ 

# proton groups * 

# 3 Aromatic atoms on opposite sides of the ring U 

Z (e.g. Tyr HE1 and HE2 protons) # 

# 4 Intraresidue ambiguities (e.g. Lys HG and £ 

# HD protons) # 

# 5 Interresidue ambiguities (Lys 12 vs. Lys 27) # 

# 9 Ambiguous, specific ambiguity not defined # 



# INSTRUCTIONS 

# 1) Replace the @-signs with appropriate values. 

# 2) Text comments concerning the assignments can be supplied in the full deposition. 

# 3) Feel free to add or delete rows to the table as needed. 

# The row numbers (_Atom_shift_assign_ID values) will be re-assigned to sequential values by BMRB 

# # The atom table chosen for this sequence is: 
loop_ 

_Atom_shift_assign_ID 

_Residue_seq_code 

_Residue_label 

_Atom_name 

_Atom_rype 

_Chem_shift_value 

_Chem_shift_value_error 

_Chem_shift_ambiguity_code 

# 

Table A 1 . Supplementary: 1 H, 1 3C and 1 5N chemical shift assignments of MSP- 1 
C -terminal fragment 

Atom Residue 

shift Seq Residue Atom Atom Shift/ Error/ Ambiguity, 

assign no. Name Name Type ppm ppm Code 



1 


1 ASN 


H 


H 


8.29 


0.02 


1 


2 


1 ASN 


HA 


H 


4.60 


0.02 


i 


3 


1 ASN 


HB2 


H 


2.86 


0.02 


2 


4 


1 ASN 


HB3 


H 


2.75 


0.02 


2 


5 


I ASN 


HD21 


H 


? 


7 


7 


6 


1 ASN 


HD22 


H 


o 


7 




7 


1 ASN 


C 


C 


? 


7 




8 


1 ASN 


CA 


C 


55.5 


0.6 




9 


1 ASN 


CB 


C 


40.9 


0.6 




11 


1 ASN 


N 


N 


125.8 


0.3 




12 


I ASN 


ND2 


N 


7 


7 




13 


2 ILE 


H 


H 


8.29 


0.02 
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stop_ 

# The following loop is used to define sets of Atom-shift assignment IDs that represent related ambiguous 
assignments taken from the above list of assigned chemical shifts. Each element in the set should be 
separated by a comma, as shown in the example below, and is the assignment ID for a chemical shift 
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assignment that has been given as ambiguity code of 4 or 5. Each set indicates that the observed chemical 
shifts are related to the defined atoms, but have not been assigned uniquely to a specific atom in the set. 

loop_ 

_Atom_shift_assign_ID_ambiguity 



4 Sets of Atom-shift Assignment Ambiguities 

U 

4 Example: 5,4,7 
u 

rr 

stop_ 
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*=? — REMARKS — 



* PROTECTED BACKBONE AMIDE GROUPS 

~ (SLOWLY EXCHANGING IN D20) FOR RESIDUES: 

= GLY 17 , PHE 19 , GLU 27 , LYS 29 , LEU 3 1 , 

4 TYR 34 , LYS 35 , VAL 42 , CYS 56 , ASP 57 , 

4 ALA 60 , LYS 61 , THR 63 , THR 75 , GLU 77 , 

4 LEU86,GLY89,ILE90,PHE91 
-i 

IT 

4 BROAD HN SIGNALS IN [15-N]-HSQC OBSERVED FOR RESIDUES: 

* VAL 8 , LYS 9 , LYS 10 , CYS 18 , ARG 20 

jj 

rr 

U TWO BACKBONE HN CROSSPEAKS OBSERVED FOR RESIDUES: 

* HIS 5 : 7.78,113.8/7.74,113.5 

* GLN 6 : 7.44,122.6 / 7.40,122.4 

4 TWO AVERAGED NHVHH* SIGNALS OBSERVED FOR RESIDUES: 

4 ARG 20 , ARG 25 : NOT SPECIFICALLY ASSIGNED 

* TO INDIVIDUAL ARGININES 

rr 

5 LYS 29 NZ/HZ' SIGNAL: TENTATIVELY ASSIGNED TO 
S LYS 29 (BURIED LYSINE SIDE CHAIN) 

* BASED ON GREATER PROTECTION FROM H20 EXCHANGE 
» THAN OTHER LYSINE NZ/HZ* SIGNALS 

iJ 
fr 

* ASPARAGINE SIDE CHAIN AMIDE SIGNALS: 

* PROBABLE OVERLAPPING CROSSPEAKS - 1 1 2 PPM [ ,5 N] 

* FOR ASN 1 , ASN 70 , ASN 96 

u 
rr 
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Exampie 3: Identification of blocking antibodies using a competitive binding assay 
and immobilised wild type GST-MSP-I19. 

In previous studies antibodies that blocked the action of the neutralising antibodies 12.8 
5 and 12. 10 had been defined either directly in the MSP- 1 42 processing assay (Blackman et 
ai. 1994) in a coupled erythrocyte invasion-MSP-l 4 2 processing assay (Guevara et ai, 
1 997) or in a competitive radioimmunoassay with merozoite protein as the antigen 
(Guevara etaL. 1997). These studies have been extended using recombinant MSP-1 and 
BIAcore analysis. 

10 

A recombinant fusion protein comprising wild type MSP-1 19 fused to GST was coupled to 
the sensor chip and competitor antibody was first allowed to bind to the antigen. Then a 
solution of either mAb 12.8 or 12.10 was passed over the chip and the amount of binding 
of this second antibody was quantified. If the first antibody interferes with the binding of 
15 the second antibody then this is reflected in a reduction in the amount of second antibody 
bound. 



Methods 

The wild type GST-MSP-1 19 was coupled to a CMS sensor chip. The binding assays were 
20 performed with a constant flow rate of 5 |^1 min" 1 at 25°C. For binding, purified mAbs 
1E1, 8A12 and 2F10 at 100 ug ml" 1 in HBS-EP buffer (lOmM HEPES pH7.4 containing 
150mM NaCl, 3mM EDTA and 0.005%v/v polysorbate 20); mAbs 1E8, 9C8, 12D11, 
111.2 and 111.4 in cell culture medium supernatant; mAbs 2.2, 7.5 and 89.1 at 1:10 
dilution of ascitic fluid in HBS-EP buffer; and mouse a-GST antibody at 1:10 -dilution 
25 serum in HBS-EP buffer were allowed to interact with immobilised wild type GST-MSP- 
li 9 for 10 min. After allowing 5 min for dissociation of low affinity interactions, either 
mAb 12.8 or 12.10 at 100 ^ig ml" 1 in HBS-EP buffer was added and allowed to bind for 10 
min. After washing the chip for 5 min the binding of 12.8 or 12.10 was measured. The 
chip vvas regenerated by washing off bound antibody with 10 mM glycine-HCl, pH 2.4. or 
30 when required with 100 mM glycine-HCl, pH 1 .8, for 3 min. 
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Results 

The results are shown in Figure 12. All the competitor antibodies bind to the GST-MSP- 
1 19 antigen, with the exception of mAb89.1 which is a negative control. As expected, 
5 mAbs 12.8 and 12. 10 competed with each other (Guevara et ai, 1 997). The other 
antibodies which do not inhibit processing could to greater or lesser interfere with the 
binding of 12.8 and 12.10. As expected from previous studies mAbs 1E1, and 7.5 
blocked both 12.8 and 12.10, whereas 2.2 and 1 1 1.4 blocked 12.8. Another particularly 
effective blocking antibody identified in this study was mAb9C8. 

10 

Example 4: Immunization of small animals with modified GST-MSP-l^and analysis 
of the antibodies induced 

To determine whether or not the modified proteins were immunogenic, recombinant GST- 
15 MSP-1 19 fusion proteins were used to raise antibodies by immunisation. 

Methods 

Two modified proteins containing either 3 [27+31+43] or 4 [15+27+31+43] amino acid 
substitutions, respectively, were used to immunise rabbits and mice. The rabbits were 
20 immunised subcutaneously with MSP-1 19 protein in Freund's complete adjuvant and then 
boosted on three occasions with 200 fig of the protein in Freund's incomplete adjuvant 21, 
42 and 63 days later, and serum samples were collected. 

The presence and level of antibodies binding to the native MSP-1 protein in the parasite 
25 was assessed by indirect immunofluorescence using acetone fixed smears of parasite- 
infected erythrocytes. The sera were diluted serially in phosphate buffered saline (PBS) 
and incubated on the slide for 30 min at room temperature. After washing, the slides were 
incubated with FITC conjugated goat anti-rabbit or anti-mouse IgG, washed, and then 
examined by fluorescence microscopy. 

30 

The sera were also analysed in an MSP-1 secondary processing assay. Analysis and 
quantitation of secondary processing of MSP-1 in merozoite preparations was by a 
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modification of an assay described previously (Blackman er ai, 1994). Washed P. 
falciparum 3D7 merozoites were resuspended in ice-cold 50 mM Tris-HCl pH 7.5 
containing 10 mM CaCU and 2 mM MgCb (reaction buffer). Aliquots of about 1 x 10 9 
merozoites were dispensed into 1.5 ml centrifuge tubes on ice, and the parasites pelleted 
5 in a micro fuge at 13,000 x g for 2 minutes at 4°C. The supernatant was removed, and 
individual merozoite pellets were then resuspended on ice in 25 jil of reaction buffer 
further supplemented with protease inhibitor or antibodies as appropriate. Merozoites 
were maintained on ice for 20 min to allow antibody binding, then transferred to a 37°C 
water bath for one hour to allow processing to proceed. Assays always included the 

10 following controls: a "positive processing" control sample of merozoites resuspended in 
reaction buffer only; a "negative processing" control sample of merozoites resuspended in 
reaction buffer plus ImM PMSF; and a zero time (Oh) control, in which processing was 
stopped before the 37°C incubation step. The processing was assayed using the western 
blot-based method and by a modified processing assay. Supernatants from the assays 

15 were obtained after centrifugation for 30 min at 4°C, 13,000 x g to remove the insoluble 
material. The amount of MSP- 1 33 in the supernatants was measured using an ELISA 
method. Fifty microlitres of diluted sample supernatants were added to the wells of an 
ELISA plate (NUNC F96 Cert. Maxisorp) that had been coated with 100 nl/well of 4 ng 
ml" 1 human mAb X509 in PBS. Plates were incubated for 4 hours at 37°C and then 

20 washed 3 times with 0.01% PBS-Tween (PBS-T). Bound MSP-I33 protein was detected 
by addition of 100 jil of 1 :4000 dilution of mouse mAb G 13 for 1 hour at 37°C, followed 
by washing and the addition of 100 fil of 1:1000 dilution of sheep anti-mouse IgG (H+L) 
HRP-conjugated antibody. After incubation for 1 h at 37°C, the plates were washed again 
and HRP was detected by the addition of 100 nl of freshly prepared substrate solution 

25 (400mg l" 1 o-phenylenediamine dihydrochloride in 0.05 M phosphate buffer, 0.024 M 
citric acid and 0.012% H2O2) at room temperature for 20 min. The reaction was stopped 
by adding 10 (il of 1 M sulphuric acid and the absorbance of each sample was measured at 
492 nm. 

30 Results 

The results are shown in Figure 13. The two modified proteins produced antibodies that 
reacted with MSP-1 in the parasite-infected erythrocyte, with a serum titre of 1:10.000, 
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which was an identical titre to that of a serum produced in the same way by immunisation 
with a recombinant protein containing the wildtype MSP-1 sequence. This indicates that 
the modified proteins can produce antibodies that react with the native protein. The 
antibodies induced by immunization were able to partially inhibit processing at the 
5 concentration used in a preliminary experiment, whereas in the control serum no 
antibodies that inhibited processing were present. 

Example 5: Design and Synthesis of a Plasmodium Falciparum Merozoite Surface 
Protein-l Gene Fragment Optimized for Pichia Pastoris Heterologous Expression 

10 

The coding sequence of the Plasmodium falciparum merozoite surface protein- 1 (MSP1) 
41.1 kDa processed fragment (MSP- 142) has been redesigned for optimal heterologous 
expression in the yeast Pichia pastoris. The optimized DNA sequence was synthesized by 
PCR gene assembly, in the form of two fragments, MSP-1 33 and MSP-1 19. P. paswris 
15 was transformed with an expression vector containing the optimized MSP-1 19 construct. 
Recombinant strains were shown to express high levels of non-glycosylated, properly 
folded MSP-1 19 protein. 

Proteins encoded by the AT-rich genome of the human malaria parasite Plasmodium 
20 falciparum are generally poorly expressed in heterologous systems (Withers-Martinez et 
al., 1999). The methylotrophic yeast Pichia (Komagataella) pastoris is an appropriate 
system for expression of disulphide-bridged proteins such as the C-terminal fragment of 
the P. falciparum merozoite surface protein- 1 (MSP1) (White et al M 1994; Morgan et al., 
1999). In the P. pastoris expression system, it is important to avoid premature 
25 transcription termination due to AT-rich stretches (Romanos et al., 1991). Codon 
preferences for highly expressed genes in P. pastoris have been identified (Sreekrishna et 
al., EP 0 586 892 Al). Therefore, a synthetic MSP-142 gene fragment with codon usage 
optimized for P. pastoris expression was designed, using novel computer software 
(Withers-Martinez et al., 1999). It has previously been shown that the MSP-1 19 fragment 
30 is partially glycosylated when expressed in P. pastoris, and the carbohydrate must be 
enzymatically removed during purification (Morgan et al., 1999). Therefore, two specific 
point mutations were introduced into the synthetic MSP-142 protein sequence in order to 
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prevent N-linked glycosylation at NxS/T sites (one potential site within the MSP-133 
sequence, and a known site within the MSP-1 19 sequence at Asn 1). 

The optimized MSP- 142 sequence was synthesized by gene assembly polymerase chain 
5 reaction (Stemmer et al.. 1995, Withers-Martinez et al. r 1999), in the form of separate 
MSP-133 and MSP-1 19 fragments. The optimized MSP-1 19 fragment was subcloned into 
a novel modified Pichia expression vector, transformed into the P. pastoris host strain 
SMD1168. and several independent transformants were isolated. There transformants 
were shown to efficiently express non-glycosylated, properly folded MSP-1 19. Strong 
10 expression of the optimized gene was observed in low copy number transformants. A 
multiple copy transformant with intermediate level G418 resistance gave expression of 
purified MSP-1 19 at a level equal to the high-expressing strain previously described 
(Morgan et al., 1999), which contains the original P. falciparum DNA. Thus, it should be 
possible to obtain even higher yields from high level G418-resistant transformants of the 
15 synthetic optimized gene. 



Methods: Gene Assembly 



The P falciparum MSP- 142 (41.1 kDa) fragment protein sequence SWISS-PROT 
20 accession number P04933: positions 1264-1621) was first altered to eliminate N-linked 
glycosylation signals by 2 amino acid substitutions. The sequences NYT (in the N- 
terminal portion; position 1445) and NIS (at the beginning of the C-tenninal fragment; 
position 1526) were changed to QYT and NLA respectively. The protein sequence was 
then reverse-translated with DNA-STAR using the S. cerevisiae codon preferences. This 
25 sequence was used as input for the CODOP program (Withers-Martinez et al., 1999). Ten 
random sequences were generated with this program, using a codon weighting table 
(Figure 14) derived from codon usage in highly expressed P. pastoris genes (Sreekrishna 
et al., EP 0 586 892 Al). Thus, the codon table should reflect usage in highly expressed 
genes, rather than average usage. The random sequence that contained the minimum 
30 number of unfavourable codons (6) was selected, and these codons were changed 
manually to more preferred alternatives. The sequence was then analysed with DNA- 
STAR to check for AT-rich sequences that may cause transcription termination, and for 
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direct and invened repeats. A set of 50 overlapping oligonucleotides coding for the final 
sequence was then generated. This consisted of 49 oligonucleotides of length 42 nt, and 
one of length 48 nt. Each oligonucleotide had a 21 bp overlap with its neighbours, with no 
gaps. Estimated T m s were in the range of 60°C to 77°C. Oligonucleotides were synthesised 
5 by Oswel (Southampton, UK) at 40 nmol scale, and supplied in deionised water without 
purification. Outside primers of various lengths for the amplification step were also 
synthesised, to give a T m of 62°C to 64°C, and contained a 5' -terminal phosphate group 
for ligation following the amplification step. The reverse primers also included a 
translation termination codon (UAA in the complementary strand). All oligonucleotides 
10 were diluted to 10 |iM in ddH 2 0 before use. 

The PCR-mediated gene assembly and amplification were carried out as described 
(Stemmer et aL 1995; Withers-Martinez et al., 1999), using a Biometra cycler, in thin- 
walled 200 ^iL tubes, under the following conditions. 

15 

Gene assembly reactions (Reaction 1): 

50 \xL volume 

2 units Vent polymerase (New England Biolabs) 
20 0.4 mM dNTPs 

1 x Vent polymerase buffer 

Oligonucleotide mix containing each oligonucleotide at 200 nM 

Cycles: 

25 32 cycles (2 h 33 m) 

denaturation 94°C 30 s 
annealing 52°C30s 
extension 72°C 3 m 



30 Three fragments of the MSP- 142 (41.1 kDa) region were synthesised separately with 
different outside primers and subsets of the 50 oligonucleotide set: 
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N-terminal fragment (bp 1-423) 21 oligonucleotides 
middle fragment (bp 337-786) 22 oligonucleotides 
C-terminal fragment (bp 787-1074) 14 oligonucleotides 

5 The C-terminal fragment produces a 10.6 kDa fragment (MSP-1 19). The N-terminal and 
middle fragments, which overlap between positions 337 and 423, were subsequently 
spliced together at the Bglll site (371-376) to give a 786 bp fragment that encodes the 30.5 
kDa MSP-1 33 protein. 

10 Amplification reactions (Reaction 2): 

1 00 jjJL volume 

10 jiL aliquot of the gene assembly reaction 
4 units Vent polymerase 
15 0.4 mM dNTPs 

1 x Vent polymerase buffer 
1 |iM outside primers 

Cycles: 

20 32 cycles (2 h 55 m) 

denaturation 94°C 45 s 
annealing 52°C 45 s 
extension 72°C 3 m 
final extension 72°C 5 m 

25 

The PCR products were then purified by filtration with Centricon-100 units (Amicon), 
and cloned directly into the vectors by blunt-end ligation overnight at 1 6°C with T4 DNA 
ligase. 

30 The synthetic MSP-1 19 gene was cloned directly into a P. pastoris expression vector. The 
modified pPIC9KHXa vector, containing a His6 tag and factor Xa cleavage site (see 
Figure 1 5) inserted in the pPIC9K SnaBI site, had been digested with Pmll and treated 
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vvith calf alkaline phosphatase. The HXa vector had been previously created by insertion 
of a 36 bp synthetic oligonucleotide, containing the His 6 tag ? factor Xa cleavage site, and 
Pmll restriction site into the SnaBI site of the pPIC9K vector. 

5 The N-terminal and middle fragment PCR products were cloned into the Smal site of the 
dephosphorylated pUC118 vector. Plasmid clones containing inserts were sequenced. 
Clones with the correct synthetic sequence were then digested and the two fragments were 
gel-purified. The N-terminal fragment clones were digested with EcoRI and Bglll, and the 
middle fragment clones were digested with Hindlll and Bglll. The recombinant fragments 

10 were purified on an agarose gel, and eluted with a QIAGEN extraction kit. The purified 
N-terminal and middle fragments were then spliced together by ligation into a pUC118 
vector that had been digested with Hindlll and EcoRI and treated with calf alkaline 
phosphatase. This created the complete synthetic MSP- 133 coding sequence. The N- 
terminal and middle fragment PCR products were cloned into the Smal site of the 

15 dephosphorylated pUC118 vector. Plasmid clones containing inserts were sequenced. 
Clones with the correct synthetic sequence were then digested and the two fragments were 
gel-purified. The N-terminal fragment clones were digested with EcoRI and Bglll, and the 
middle fragment clones were digested with Hindlll and Bglll. The recombinant fragments 
were purified on an agarose gel, and eluted with a QIAGEN extraction kit. The purified 

20 N-terminal and middle fragments were then spliced together by ligation into a pUC118 
vector that had been digested with Hindlll and EcoRI and treated with calf alkaline 
phosphatase. This created the complete synthetic MSP- 133 coding sequence. 

Methods: Expression and Purification 

25 

The methylotrophic yeast Pichia (Komagataella) pastoris strain SMD1168 was 
transformed by electroporation as described previously (Morgan et al., 1999). In addition, 
some G4 18 -resistant clones were isolated using Hybond-N+ membranes (Fairlie et al.. 
1999). 

30 

Expression screening of transformants -was performed by growing 10 ml cultures in 
buffered minimal glucose medium. Cells were harvested and resuspended in 10 ml 
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buffered minimal methanol medium at 1.0 OD 6 oo and grown overnight to a final OD 6 oo of 
2.5 to 3.0. Cells were removed by centrifugatiomand 1.2 mi of the supernatant medium 
was precipitated 30 min on ice with 15 % trichloroacetic acid. The samples were 
centrifiiged for 30 min at 14000 rpm at 4 °C in a micro fuge, and the protein pellets were 
5 washed twice with cold acetone. Samples were resuspended in 12 pi ddHiO, and 5 ul was 
electrophoresed, after reduction with DTT, on NOVEX pre-poured acrylamide gels 
according to manufacturer's instructions. NOVEX 4-12 % acrylamide gradient, or 10 % 
acrylamide, Bis/Tris gels in MES buffer were used. Protein gels were stained with 
Coomassie colloidal Brilliant Blue stain (Sigma). 

10 

Homogeneously purified MSP-1 19 was obtained as described previously (Morgan et aL 
1 999), except that enzymatic deglycosylation was omitted for the synthetic gene products. 

Methods: NMR 

15 

One-dimensional *H- and 2-dimensional { l H/ l5 N}-HSQC spectra were acquired as 
described previously (Morgan et aL, 1999), at 25 °C, at sample concentrations of 1.1-2.5 
mM. 

20 Results 

The sequences of the synthetic DNA fragments, and the resulting predicted protein 
products, are shown in Figure 15. A summary of the resulting improvements to the 
sequence is given in Table 3. 
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Total codons P. pastoris preferred Unfavourable % AT content 
codons codons 



P. falciparum MSP1 358 140 28 74 

41.1 kDa fragment 

Synthetic 41.1 kDa 358 276 0 58 

fragment 

Table 3. Codon usage 

5 

PCR-gene assembly reactions for the MSP- 133 (two sections) and MSP-119 synthetic 
fragments are shown on agarose gels in Figure 1 6. This demonstrated that a single, correct 
size major product was observed in each case. The PCR products were subcloned, 
screened, and sequenced as described in the Methods section. 

10 

P. pastoris was transformed with the synthetic MSP-119 construct in the modified 
pPIC9K expression vector (pPIC9K-HXa; Figure 15). Expression of the synthetic MSP- 
119 product in three independent transformants is shown on a protein gel in Figure 17. 
The protein samples were prepared by trichloroacetic acid precipitation from culture 

15 supernatants as described in the Methods section. This demonstrated that a single, major 
product was present in each sample, corresponding to the expected migration of the 
synthetic MSP-119 protein. This migrated slightly more slowly than the control .sample, 
which as described previously (Morgan et al., 1999) has a shorter N-terminal tag 
sequence. There was no trace of heterogeneous, slowly migrating recombinant protein that 

20 would result from glycosylation. Therefore, non-glycosylated, synthetic MSP-119 is 
efficiently expressed by the transformed yeast. The yield (measured by UV absorbance) 
of purified MSP-119 was 16 mg/L for low copy number transformants (resistant to 0.25 
mg/ml G418), and increased to 24 mg/L for intermediate G418 resistance (resistant to 1.0 
mg/ml G418). This can be compared with yields of 1-2 mg/L for low copy number 

25 transformants of P. pastoris with the original Plasmodium falciparum coding sequence. 
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before isolation of a highly G418-resistant strain (Morgan et al.. 1999). This indicated 
that the synthetic MSP-1 19 construct is advantageous for recombinant protein expression, 
and that further improvement would result from isolation of higher copy number 
transformants. 

5 

One-dimensional proton NMR experiments demonstrated that the synthetic MSP-119 
protein spectrum was very similar to the previously studied protein (Morgan et al., 1999), 
and represented a correctly folded protein (data not shown). This was further confirmed 
by a 2D-{ I H/ 15 N}-HSQC spectra (Figure 18), which also shows that the structure of the 

10 synthetic product is identical to the previously studied protein, except for slight 
differences at the N-terminus which are consistent with the presence of a distinct N- 
terminal tag sequence, and S3->A mutation at the glycosylation site. Backbone NH 
proton and l5 N chemical shifts for the original P. falciparum sequence product have been 
previously presented (Morgan et al., 1999). The similarity between the two spectra, 

15 outside of the N-terminal region, is strong evidence that both protein forms are in a 
structurally similar, correctly folded state. 
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CLAIMS 

1. A non-naturally occurring variant of a C-terminal fragment of a Plasmodium 
merozoite surface protein- 1 (MSP-1) wherein said variant has (i) a reduced affinity, 

5 compared with a naturally occurring Plasmodium MSP-1 19, for at least one first antibody 
capable of blocking the binding of a second antibody, which second antibody inhibits the 
proteolytic cleavage of Plasmodium MSP-1 42 and (ii) substantially the same affinity for at 
least one third antibody compared with said naturally occurring Plasmodium MSP-1 19. 
which third antibody inhibits the proteolytic cleavage of Plasmodium MSP-1 42- . 

10 

2. A variant according to claim 1 wherein said Plasmodium MSP-1 19 and MSP-1 42 
are Plasmodium falciparum MSP-1 19 and MSP-I42. 

3. A variant according to claim 1 or 2 wherein the first antibody is selected from 
15 mAbs lEl,2.2,7.5,9C8and 111 A 

4. A variant according to claim 1 or 2 wherein the second and/or third antibody is 
selected from mAbs 12.8, 12.10 and 5B1. 

20 5. A non-naturally occurring variant of a C-terminal fragment of a Plasmodium 
merozoite surface protein- 1 (MSP-1) comprising an amino acid modification at any one of 
amino acid residues 14, 15, 27, 31, 34 , 43, 48 and 53 of the Plasmodium falciparum 
MSP-1 19 amino acid sequence shown as SEQ I.D. No. 1 or their equivalent positions in 
other Plasmodium MSP-1 19 polypeptides. 

25 

6. A variant according to claim 5 wherein said modifications are selected from 
Glnl4-»Arg, Glnl4->Gly, Asnl5->Arg, Glu27->Tyr, Leu31-»Arg, Tyr34-*Ser, 
Tyr34->Ile, Glu43->Leu, Thr48-»Lys and Asn53->Arg and their equivalents in other 
Plasmodium MSP-1 19 polypeptides. 

30 

7. A variant according to claim 6 wherein said substitutions are selected from 
[Glu27-»Tyr, Leu31-+Arg and Glu43-»Leu], [Glu27->Tyr, Leu3 1 -» Arg, Tyr34-»Ser 
and Glu43-»Leu], [Asnl5-»Arg, Glu27->Tyr, Leu31-»Arg and Glu43-*Leu] and their 



NSOOCIO: <WO 0063245A2 I > 



WO 00/63245 



PCT/GB00/01558 



-104- 

equivalents in other Plasmodium MSP-1 19 polypeptides. 

8. A variant according to any one of claims 5 to 7 further comprising a mutation at 
Cysl2 and/or Cys28 of the Plasmodium falciparum MSP-1 19 amino acid sequence shown 

5 asSEQI.D.No. 1. 

9. A variant according to claim 8, in which the mutation is a substitution selected 
from Cysl2— He and Cys28— Trp, and Cysl2— Ala and Cys28— Phe. 

10 10. A variant according to claim 9, in which the substitutions are selected from 
[Cysl2— He, Asn 15— Arg, Glu27— >Tyr, Cys28— Trp, Leu31— Arg, Glu43— Leu], 
[Cysl2— lie, Asn 15— Arg, Glu27— Tyr, Cys28— Trp, Leu31 — Arg, Glu43— Leu, 
Asn53— Arg], [Cysl2— He, Asn 15— Arg, Glu27— Tyr, Cys28— Trp, Leu31— Arg, 
Tyr34— ►Sen Glu43— Leu, Asn53— Arg] and their equivalents in other PlasmodiumMSP- 

15 1 19 polypeptides. 

11. A method for producing a Plasmodium MSP-1 variant for use in preparing a 
vaccine composition which method comprises modifying one or more amino acid residues 
of a Plasmodium MSP-1 C -terminal fragment such that the resulting derivative has (i) a 

20 reduced affinity, compared with a naturally occurring Plasmodium MSP-1 19, for at least 
one first antibody capable of blocking the binding of a second antibody, which second 
antibody inhibits the proteolytic cleavage of Plasmodium MSP-1 42 and (ii) substantially 
the same affinity for at least one third antibody compared with said naturally occurring 
Plasmodium MSP-1 19, which third antibody inhibits the proteolytic cleavage of 

25 Plasmodium MSP- 1 42. 

12. A non-naturally occurring Plasmodium MSP-1 variant obtained by the method of 
claim 1 1 . 

30 13. A polynucleotide encoding a variant according to any one of claims 1 to 10 or 12 
operably linked to a regulatory sequence capable of directing the expression of said 
nucleotide in a host cell. 
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14. 14. A polynucleotide according to claim 13 having a sequence optimised for 
expression in said host cell. 

15. A polynucleotide according to claim 13 or 14 ? in which the host cell is a Pichia 
5 pastoris cell. 

16. A nucleic acid vector comprising a polynucleotide according to claim 13, 14 or 15. 

17. A host cell comprising a vector according to claim 1 6. 

10 

1 8. A pharmaceutical composition comprising a variant according to any one of claims 
1 to 10 or 12, a polynucleotide according to any one of claims 13 to 15 or a vector 
according to claim 16 together with a pharmaceutical^ acceptable carrier or diluent. 

15 19. A composition according to claim 18 further comprising an immunogenic 
Plasmodium polypeptide or fragment or derivative thereof. 

20. A method for producing anti-MSP- 1 antibodies which method comprises 
administering a polypeptide according to any one of claims 1 to 10 or 12, a polynucleotide 

20 according to any one of claims 13 to 15 or a vector according to claim 16 to a mammal. 

21. A method for producing polyclonal anti-MSP- 1 antibodies which method 
comprises administering a polypeptide according to any one of claims 1 to 10 or 12, a 
polynucleotide according to any one of claims 13 to 15 or a vector according to qjaim 16 

25 to a mammal and extracting the serum from said mammal. 

22. An antibody produced by the method of claim 20 or 2 1 . 

23. A method of inducing immunity against malaria induced by Plasmodium 
30 falciparum which comprises administering to a person in need of such immunity an 

effective amount of the variant of claim 1, the polynucleotide of claim 13 or the vector of 
claim 16. 
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24. A method of immunizing a mammal, said method comprising administering an 
effective amount the polypeptide of claim L the polynucleotide of claim 13 or the vector 
of claim 16. 

5 

25. The method of claim 24, wherein said mammal is immunized against malaria. 

26. A method of treating a malaria infection in a human patient which comprises 
administering to the patient an effective amount of the pharmaceutical composition of 

!0 claim 18 or 19. 

27. A variant according to any one of claims 1 to 10 or 12, a polynucleotide according 
to any one of claims 13 to 15 or a vector according to claim 16 for use in therapy. 

15 24. A computer readable medium having stored thereon a model of the MSP-1 1 9 NMR 
structure. 

25. A nucleic acid encoding a Plasmodium MSP-1 polypeptide, in which the nucleic 
acid is optimised for expression in a heterologous host cell. 

20 

26. A nucleic acid according to Claim 25, in which the heterologous host is a Pischia 
pastoris cell. 

27. A nucleic acid according to Claim 25 or 26, in which the polypeptide is selected 
25 from an MSP-1 42 polypeptide comprising a sequence shown in Figures 2C and 2E, an 

MSP-1 19 polypeptide comprising a sequence shown in Figure 2C, and an MSP-I33 
polypeptide comprising a sequence shown in Figure 2E. 

28. A nucleic acid according to any of Claims 25 to 27, in .which the nucleic acid 
30 comprises a sequence selected from the sequences of Figure 2 A, Figure 2B and Figure 2D. 
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29. A nucleic acid vector comprising a nucleic acid according to any of Claims 25 tp 
28. 

30. A host cell comprising a vector according to claim 29. 

31. A pharmaceutical composition comprising a nucleic acid according to any of 
Claims 25 to 27 or a vector according to Claim 29 together with a pharmaceutical^ 
acceptable carrier or diluent. 

32. A composition according to Claim 31 further comprising an immunogenic 
Plasmodium polypeptide or fragment or derivative thereof. 
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= Coiion Usage Table for Komagataella (Pichiai pastoris 
== for use with CODOP program 

= Fields: [amino acid] [triplet] [frequency per thousand] fnumber] 
s codon preferences weights: 

= 30 preferred uniquely 

= 20 preferred alternative 

- 10 second best alternative 

i 1 5 ruo equally preferred codons 

= 1 unclassified 

P 0 forbidden codons 

# codon selection list 

# 

# Phe TTC 

# Leu TTG > CTG 

# He ATC = ATT and not ATA 

# Met ATG 

# Val GTT > GTC and not GTA 
U Ser TCT = TCC 

# Pro CCA > CCT and not CCG 

# Thr ACT = ACC 

# Ala GCT > GCC and not GCG 
#Tyr TAC and not TAT 

# * UAA 

# His CAC > CAT 

# Gin CAA > CAG 
U Asn AAC 
U Lys A AG 

# Asp GAC > GAT 
#Glu GAA = GAG 

U Cys TGT > TGC 

n Trp TGG 

# Arg AGA and not CGC . CGA . CGG 
#Glv GGT > GGA and not GGG 



PheUUU l.0( 1) 

Phe UUC 30.0( 30) 

LeuUUA 1.0( 1) 

Leu UUG 20.0( 20) 



LeuCUU 1.0( 
Leu CUC 1.0( 
LeuCUA l.0( 
Leu CUG 10.0( 

He AUU 15.0( 
He AUC 15.0f 
lleAUA 0.0( 
Met AUG 30.0f 

Val GUU 20.0( 
Val GUC 10.0( 
Val GUA 0.0( 
ValGUG 1.0( 



1) 
1) 
0 
10) 

15) 
15) 
0> 
30) 

20) 
10) 
0) 
1 ) 



Ser UCU 15.0( 
Ser UCC 15.0( 
Ser UCA 1 .Of 
Ser UCG 1.0( 

Pro CCU 10.0( 
Pro CCC 1.0( 
Pro CCA20.0f 
Pro CCG 0.0( 

Thr ACU 15.0( 
Thr ACC I5.0( 
Thr ACA 1 .0( 
Thr ACG ].0(" 

Ala GCU20.0r 
Ala GCC 10.0( 
Ala GCA l.Of 
Ala GCG O.Of 



15) Tvr UAU 0.0( 
15) Tyr UAC 30.0( 
1 ) Ter UAA 30.0( 
1) Ter UAG 0.0( 

10) His CAU 10.0( 
H His CAC20.0( 
20) Gin CAA 20.0( 

0) Gin CAG 10.0( 

15) Asn AAU l.0( 
15) Asn AAC30.0( 

1) Lys .AAA I.0( 
1) Lys AAG30.0( 

20) Asp GAU 10.0( 
10) Asp GAC20.0( 
1) Glu GAA 15.0f 
0) Glu GAG I5.0( 

FIG. 14 



0) 


Cys UGU 20.0( 


20) 


30) Cys UGC I0.0( 


10) 


30) 


Ter UGA 0.0( 


0) 


0) 


Trp UGG 30.0( 


30) 


10) 


Arg CGU l .Of 


1) 


20) 


Arg CGC 0.0( 


0) 


20) 


Arg CGA 0.0( 


0) 


10) 


Arg CGG 0.0( 


0) 


1) 


Ser AGU l.Of 


1) 


30) 


Ser AGC l.Of 


n 


1) 


Arg AGA 30.0( 


30) 


30) Arg AGG l.Of 


1) 


10) 


Gly GGU20.0( 


20) 


20) Gly GGC I.Ot 


n 


15) 


Gly GGA 10.0( 


10> 


15) 


Gly GGG O.Of 


0) 
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LO 
C3 



U XJ CJ 03 XJ 

C* 03 03 CP 03 
4-j ra xJ 03 CP 
(J CP CJ 03 xj 

4-J XJ XJ XJ U 

O f iJ t) 

O xj O 03 cn 

CP 4-J XJ CJ 05 

03 03 4-> 01 03 

03 u u j-> a 

CP 03 0J CJ 03 

XJ Cn CP 03 XJ 

XJ CP 4J 03 CP 

CJ 03 4-J U 03 

03 03 4-» 03 03 

4-J U 03 4-J CP 

C*P XJ CP 4-* 03 

4-J CP CP O CP 

4-J O 03 XJ XJ 

o 03 03 u a 

4-J 03 CJ CJ 4-J 

CP 4-J 03 4-J CP 

CP 4-J 4-J CP-UJ 

03 CP 03 4-J 4-J 

CPUU4J u 
CJ 03 U CJ 4-J 
03 03 xJ xj 03 

XJ O XJ XJ CP 
03 4-J 03 CP 03 
03 4-> CP 03 03 
CP 4-J 4-J 03 O 
O U 4-> O 03 
03 03 CJ 03 XJ 
03 CP 03 CP CJ 
CP XJ CP 03 03 
03 03 XJ CP 4-» 
CP U O ^ a-J 
XJ xj xj CP CP 
4-J CP CP 03 CJ» 
03 O 03 03 CP 
CP 03 CP CPXJ 
m 03 CP 03 u 

m u xj cp u 

XJ 03 O CP 4-J 

U 03 4-J 03 CP 

4-J 03 4-J 03 u 

CP 03 CP U 03 

4-> CP O 03 CP 

O CP 03 05 CJ 

4-J XJ 03 CP 03 

4-J U CP 4-J 03 

05 05 03 CJ XJ 

CJ 05 03 CJ CJ 

03 CJ CJ 4-J CP 

03 CP 4-J xJ U 

4-J 03 4-J CP 4-J 

03 03 U 03 4-J 

CP CP 05 03 O 

U 03 03 U 03 

xJ 05 03 03 05 

0? CP 03 xJ CJ 

4-J 4-J CP 03 4-J 

4-> CJ 03 U 03 

CP CJ CP CJ xj 

4-» O 03 U 03 

4-J CT> 03 CP 

03 03 CP 4-J 

CP 03 CP U 

_ 03 CJ 03 03 

CJ CJ 03 03 xj 

XJ 03 03 4-> 03 

CJ J-> CJ XJ CP 

03 CJ XJ CP J-J 
X-* 4-> XJ 4-J 4-J 
4-J CP 03 xJ 03 
CP 4J CP CP u 
xJ CP 05 CJ O 

o cp u or- 4J 

CP O U 4-» o 



u 



4-J ro 03 

U 03 U 

03 03 03 

03 03 03 

03 CP CP 

CP U 03 

xJ 03 03 

4-J xJ CP 

05 XJ 03 

CJ U 03 

03 03 CJ 

03 U 4J 

4-» 03 4J 

03 XJ u 

03 03 03 

CP 03 CP 

4J U XJ 

U CP U 

O XJ CP 

XJ XJ CP 

xJ CJ 4-J 

03 4-> U 

CJ CP CP 

a cp 03 

CP 03 03 
XJ 03 XJ 
CJ CJ 03 
CJ U CP 
03 CP CP 
4-> 03 03 
CP 03 U 
03 CP 4-J 
03 CP4-> 
03 4-J 03 
03 XJ CJ 
CP 4-J U 
4-J 03 03 
03 'J CP 
03 XJ 03 
CP 4-J r0 
03 03 U 

CP O 4-J 

03 XJ CJ 
CP CP U 
CP U 03 
03 4-J 4-J 
05 XJ CJ 

U CP 03 
CP XJ 03 
03 U CP 
03 xJ XJ 
XJ 03 XJ 
03 CP 03 
CP O 03 
4-J 4-J CP 
03 03 CP 
03 CP 03 
U 03 03 
4-J 03 U 
03 XJ XJ 
CJ 03 03 
03 CP CP 
XJ CJ 03 
CP 03 03 
03 03 U 

03 XJ XJ 
CP 4-J CP 
03 CP CP 
03 XJ 03 
(J 'J CP 
XJ 03 CJ 
03 CP J-J 

o 03 cr 

O 03 O 
4_i (J 03 
XJ 03 03 
03 4-J xJ 
CP CP U 

CP 4-J XJ 
4-J 4-J CP 



CP 03 

XJ (J 

03 0? 

4-J 03 

CP U 
CP O 

U U 

U CP 

03 4-1 

4-J XJ 

'J 03 

4-> CJ 
CP 03 

4-> 03 

U CP 

CJ xj 

XJ xJ 

XJ CP 

CP 4-J 
03 03 
03 4-J 
4-J CP 

U CP 
03 CP 
CP 03 
4-» U 
4J CP 
CP 4-J 
4-J XJ 

o u 

xJ 03 

03 03 

03 4-J 

U CP 

03 CP 

03 4-J 

U 03 

03 CP 

U CP 

U 4-J 

03 -xj 

03 CP 

U XJ 

03 CJ 

XJ O 

XJ 03 

03 03 
CP CJ 

4-J U 

U XJ 

03 CP 

U XJ 

CJ CJ 

4-J U 

CP XJ 
xj CP 

XJ XJ 

xj CJ 
03 03 
CP CP 
XJ 03 
U 03 

CP 4-J 

U U 

4-J CP 

03 CP 

4-J XJ 
CP 4-J 

CP CJ 

4_" 03 
4-J 03 
CP CP 
O 03 
4-J CP 
XJ U 

O 4-1 

03 4-J 

03 4-1 

U iJ 

03 CP 



CP CJ 
4-J CP 
O O 
4-J 03 
CP CP 
XJ O 
CP CP 
03 XJ 
03 03 
4_> CP 
CP CP 

4-J XJ 

03 CP 
03 CP 
CP CJ 
03 03 
03 03 
CP U 
03 03 
CP 03 
03 CP 
CP 03 
03 CP 
CP U 
O 03 
03 03 
CP 4-J 
CP CP 

XJ XJ 

CJ CJ 

4-J O 

03 03 

U XJ 

03 O 
CP U 

03 U 

V 03 

XJ 03 

XJ 03 

4-J U 

CP cj 
XJ CJ 
03 OJ 
CP 03 
CP CP 
CJ 03 
U CP 

4-J 4-J 

U xj 

03 CP 

03 XJ 

03 CP 

03 4-J 

U CP 

03 03 

U 03 

U 4-J 

4-1 03 

CP CP 

4-J 4-1 
03 CP 
03 CP 

CJ 03 
CP 03 
03 CP 
03 CP 
CP 03 
03 'J 
03 CP 
XJ 03 

4-J 03 

CP O 
'J O 
CP XJ 
xj CJ 
0? 03 
03 03 
O CP 

U 4-1 



a 

XJ 

u 
m 
cn 

03 

cj 
u 

CP 

ro 

03 
XJ 

u 

03 

XJ 

CP 

XJ 
03 
03 
CP 

a 

CP 



n3 
CP 
03 
03 
CP 
03 

m 

03 
CP 

CP 03 
CJ 03 
03 4-J 
0J CJ 
4-> 03 
O 03 

4-> 4-» 
4-> U 

CP xj 
CP U 
4J U 

u xj 

4-J 4J 

u cj 

03 4-J 
CP 4-J 
03 CP 
03 4-J 
CP CJ 
03 4-J 
03 XJ 
CP CJ 
U xJ 
CJ 03 
03 03 
O CP 



CP 03 
03 CP 
03 U 

XJ XJ 
U 4-J 

CP C^ 
CJ xj 

03 4-J 
CT' 4_J 



CP 03 
CP 03 

u u 
u u 

xj CJ 

CJ 03 

03 03 

03 CP 

03 03 

03 CP 

U XJ 

03 4-J 

U CP 

CJ XJ 

xj CP 

CP 4-1 
4J CP 

03 03 

03 03 

CJ XJ 

CP 03 

03 CP 
03 

CP CP 

03 CP 

03 03 

4-J 03 

XJ CP 

CP CP 

O 03 

CP U 

4-J CP 

03 03 

03 03 

O CJ 

CJ 03 

03 4-J 

U U 

03 03 

03 03 

O CP 

U xj 



XJ U 



CJ CJ 
03 4-J 
CP U 
03 U 
03 xJ 
CP 4-J 
03 U 
03 4-J 
CP --J 
CJ CP 
U XJ 

03 a 

O 4-? 

CP xj 



03 O 

03 fC 



CP CJ» u 



xj U 

03 J-J 

U CP 

03 4-J 

03 CP 

CJ 03 

< 03 

O 4-» 

< CP 
CD 4-J 
<=C 03 
E-« 03 
O CP 
CD 03 

< 03 

< CP 
CD 03 

CP 

^ 0) 

< CP 
O OJ 

< CP 
CJ o 

^ 03 

< CP 
U CP 



m ' — » 
C>» 0 

4-> 4-J 
O XJ 

CP CP 

U 4-J 

03 J-J 
CP J-J 
U O 
CP U 
4-1 O 
03 r? 

CP 4-> 
CP 4-J 
4-J O 
CP 4-1 
CP O 

V 03 

03 CP 

03 ^ 

O CJ 

03 U 

T \- • 

CP r? 

03 0? 

CP 4-J 

M 'J. 

^ 03 <. 
03 4_> =- 



< O 

U 4-J 

c-» 03 

< CJ J-J CT *J? 

o or- cp 4-j cr 

E-» CP 4-> <T> 

C_> 4-J C 'J . v 

< 4-J XJ CP4-: 



O 

Q 



CJ CP 

J-A* < XJ C 



-J 

■r. x.- 



> 
u 

Q 
C_7 
CJ 

O 

u 

CJ • 

a: z 

CJ CO 
Q CO 
CO 

rn o 
cc Cu 

cj a 

O Q 
CO C14 

CU >4 

(J CO 
O Q 
jC CU 

jC ^ 

> H 

CJ o 
O CJ 

= u 

O E-» 

< H-f 
HH ^ 

= LD 

'J? 01 
U O 
v-l 0*2 



= CJ 
— CJ 

: r: ^ 



o 

CP 
CP 

4-J 

u 

4-J 

u 
u 

CP 
03 
03 
CP 



a 

03 
4-J 
CP 



CP 
CP 
03 
CP 
O 
03 
4-J 
03 
03 
CP 
U 
OJ 
03 
cn 

03 

cn 

4-J 
4-J 

03 
CP 
03 
03 
4-J 
CJ 
4-J 
CP 



XJ 

03 
CJ 

m 

03 

4-J 
03 
CP 
U 



XJ 

CP 
XJ 
U 
4-J 
03 
O 
CJ 



XJ CJ 
U 05 
03 03 
03 4-J 
CP O 
XJ xJ 
XJ XJ 

xj CJ 

XJ 4-J 
03 4J 
O U 

03 03 
CP Cn 
CP 4-J 
03 4-J 
03 4-J 

(J 03 

4-J CP 

CP CP 

U 03 

03 03 

03 (J 

4-> 03 

XJ XJ 
CP 03 

CJ CJ 

03 O 

03 4-J 

U 4-J 

4-J 03 

4-J CP 

4-J 4-J 

CJ 4-J 

03 U 
CP 03 
xj CP 

03 4-J 

CJ (J 

4-J 4-J 

CP CP 

a 03 

05 CP 
03 CP 
CJ 4J 
03 U 
03 XJ 
03 4-J 
03 CP 
CP CJ 
CP 03 
4-J 03 
CJ CP 
03 03 
03 03 
CJ CJ 
CP 4-J 
03 4-J 
03 CJ 
CP 03 
03 03 
03 03 
CP 03 
XJ CP 
CJ 03 
CJ CP 
U 03 
4-J CP 
03 03 
CP 03 
03 CJ 
CJ 03 
03 03 
xj O 

O 4-J 
XJ 4-J 

CP 03 
4-J CP 

CP 03 
CP O 
U U 



03 4-J 

CP CJ 

CP 03 

05 CP 

03 4-J 

4-J CJ 

4-1 4-J 

05 CP 

CJ 03 

03 05 

4-1 CJ 

fj 03 

03 4J 

03 CP 
CJ 03 
03 03 
4-J CP 
4-J 03 
CJ CP 
4-J 4-J 
CJ CJ 
CJ 4-J 
4-J CP 
CP 4-J 
4-1 4-J 
XJ CJ 
CJ 4-J 
4-J 03 
4-J CP 
CP 03 
03 03 
03 U 
CJ 03 
03 4-J 
CP CJ 
03 03 
CP J-J 
03 xj 
CP CP 
03 CP 
03 CP 

CP 4-J 

03 CJ 
CP CJ 
CP 4-J 
03 CP 
03 U 
O 0J 
03 CP 
03 O 
CP 03 
4-J 05 
CJ 4-J 
O CJ 

4-J CP 

4-J CJ 
CP 4-J 
03 4-J 
03 CJ 
CJ 03 
03 03 

4-J U 
03 4-J 
CJ 03 
CJ 4-J 
CJ 03 
03 CP 
CP 4-J 
CP CJ 
03 03 
03 4-> 

XJ 03 

xj cn 

CP 4-J 
4-J XJ 
xJ 03 
CP U 
O CJ 
03 4-J 
4-J CJ 



4-J 03 03 

O 03 CJ 

05 03 03 

03 03 03 

03 CP CP 

a- u o3 

xj 03 03 

J-J j-J CP 

03 4-J 03 

U CJ 03 

03 03 U 

03 CJ xj 

4-J 03 xJ 

05 4-J XJ 

03 03 03 

CT* 03 CP 

4-J CJ 4-J 

O CP U 

CJ xj Cn 

xj xj Cn 

xj CJ XJ 

05 4-J CJ 

CJ CP CP 

CJ CP 05 

CP 05 03 

XJ 03 XJ 

O CJ 03 

CJ CJ CP 

03 CP CP 

XJ 05 03 

cn o5 CJ 

03 CP XJ 

03 CP 4-J 

03 4-J 03 

05 4-J CJ 

CP 4-J O 

XJ 03 03 



CP 
ro 
rc 
U 
xj 
u 



03 O 
05 

CP 4-J 

05 05 
CP CJ 
03 

CP CP U 
CP CJ 03 

03 4-J XJ 

03 xJ CJ 
CJ CP 03 
CP XJ 03 
03 CJ cn 

03 4-J XJ 

4-J 03 XJ 

03 CP 03 

CP CJ 03 

XJ xJ CP 

03 03 CT* 

03 CP 03 

CJ 03 03 

XJ 03 CJ 

05 xj xJ 

CJ 03 03 

03 CP CP 

XJ U 03 

CP 03 03 

03 03 U 

05 XJ XJ 
CPXJ CP 

03 CP CP 

05 xJ 03 

CJ CJ CT 

4-J 03 O 

03 CP XJ 

CJ a? ~ 

O 03 
XJ 03 03 
03 XJ xj 
CP CP 'J 
CP xJ x- 
XJ XJ CP 



CJ CP 

CJ XJ 

XJ 4-J 

xj CP 

Cn xj 

o3 03 

03 xj 

4— J CP 

U CP 

o3 cn 

CP o3 

xj U 

xj CP 

CP xj 

XJ XJ 

CJ U 

XJ rr, 
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VARIANT OF THE PLASMODIUM MEROZOITE SURFACE PROTEIN (MSP-1) AND 
VACCINE COMPRISING SAID VARIANT 

Field of the Invention 

5 The present invention relates to modified Plasmodium MSP- 1 protein variants and their 
use in producing a vaccine against malaria. It also relates to a method for the rational 
design of suitable variants. 

Background to the Invention 

10 

Malaria is a devastating disease that causes widespread morbidity and mortality in areas 
where it is transmitted by anopheline mosquitoes. In areas of high transmission young 
children and non-immune visitors are most at risk from this disease, which is caused by 
protozoa of the genus Plasmodium. In areas of lower or unstable transmission, epidemics 
15 of the disease can result and afflict individuals of all ages. The most dangerous form of 
malaria, responsible for much of the morbidity and most of the mortality, is caused by the 
species Plasmodium falciparum. It has been estimated that 2 billion people are at risk 
from malaria, with 200-300 million clinical cases and 1-2 million deaths each year. 

20 The parasite has a complex life cycle in its human and mosquito hosts. In humans the 
stage of the life cycle which is responsible for the clinical symptoms of the disease occurs 
in the bloodstream. During this phase the parasite is largely hidden within host red blood 
cells. Here the parasite grows and multiplies. For example, within a red blood cell each 
P. falciparum parasite divides several times to produce approximately 20 new ones during 

25 a 48 hour cycle. At this point the red blood cell is burst open and the parasites (called 
merozoites at this stage) are released into the bloodstream. The merozoites must enter new 
red blood cells in order to survive and for the cycle of replication in the blood to continue. 
If the parasites do not manage to enter red blood cells they cannot survive for very long 
and are rapidly destroyed. Symptoms of malaria such as fever are associated with this 

30 cyclic merozoite release and re-invasion of red blood cells. 
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There is an urgent need for a vaccine against malaria. There is no effective vaccine 
currently available. In addition, mosquito control by the spraying of residual insecticides 
is either becoming ineffective or considered to be unacceptable, and there is a very 
worrying spread of drug resistance within parasites. The rapid spread of drug resistance is 
5 worrying because compounds such as the cheap and once-effective chloroquine are no 
longer useful in many parts of the world, and there are few if any new drugs available that 
are both cheap and effective. Vaccines against microorganisms can be very cost effective 
and efficient ways to protect populations against infectious diseases. 



10 Because of the complexity of the parasite's life cycle there are a number of points in its 
development within humans that could be the target of a protective immune response. It 
is known that with increasing age and exposure individuals do become immune to 
malaria, suggesting that protective responses do develop with time. Broadly speaking 
there are three types of vaccine strategy: to target the pre-erythrocytic stages, the asexual 

1 5 blood stage and the sexual stage. The pre-erythrocytic stages are the sporozoites that are 
injected by an infected mosquito when it takes a blood meal and the initial development of 
the parasite in the liver. The asexual blood stage is the infection and release of merozoites 
from red blood cells that occurs in a cyclic manner, and the stage responsible for the 
manifestation of the clinical symptoms. The sexual stage takes place in the mosquito's 

20 gut after it has ingested gametocytes in a blood meal and this initiates the infection of the 
insect to complete the cycle; a vaccine against the sexual stages would not protect the 
individual but could reduce transmission and therefore the incidence of malaria in a given 
human population. 

25 During the asexual cycle in the blood the parasite is directly exposed to the host's immune 
system, and in particular to antibodies circulating within the bloodstream, only transiently: 
when merozoites are released by rupture of one cell and before they penetrate another. If 
there are specific antibodies that can bind to the surface of the parasite then it is possible 
that these antibodies will interfere with the ability of the parasite to invade a new red 

30 blood cell. In fact it has been shown that several monoclonal antibodies that recognise 
single epitopes on parasite surface proteins, are capable of neutralising the parasite and 
preventing the cycle of reproduction within red blood cells. 
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One of the best characterised proteins on the surface of the merozoite is called merozoite 
surface protein 1 (MSP-1). MSP-1 is a large protein that varies in size and amino acid 
sequence in different parasite lines. It is synthesised as a precursor molecule of -200 kDa 
5 by the intracellular parasite and located on the parasite's surface. During release of 
merozoites from red blood cells and the re-invasion of new erythrocytes the protein 
undergoes at least two proteolytic modifications. In the first modification as a result of a 
process called primary processing, the precursor is cleaved to four fragments of -83, 30, 
38 and 42 kDa that remain together as a complex on the merozoite surface. This complex 

10 also contains two other proteins of 22 kDa and 36 kDa derived from different genes. The 
complex is maintained by non-covalent interactions between the different subunits and is 
held on the merozoite surface by a glycosyl phosphatidyl inositol anchor, attached to the 
C-terminus of the 42 kDa fragment and inserted into the plasma membrane of the 
merozoite. At the time of merozoite invasion of an erythrocyte the C-terminal 42 kDa 

15 fragment is cleaved by a second proteolytic cleavage in a process called secondary 
processing. The result of secondary processing is that the entire complex is shed from the 
surface of the merozoite except for a C-terminal sub-fragment that consists of just under 
one hundred amino acids and which is carried into the newly invaded erythrocyte on the 
surface of the merozoite. 

20 

Based on sequence similarities, the structure of this small C-terminal fragment (called 
MSP-1 19) was suggested to consist of two epidermal growth factor (EGF)-like domains 
(see sequence in Figure 1) (Blackman et al y 1991). An EGF-like motif consists of a 45-50 
amino acid sequence with a characteristic disulphide bonding pattern and such domains 

25 occur frequently in extracellular modular proteins of animals. In the MSP-1 C-terminal 
fragment each of the motifs contains six Cys residues proposed to form three disulphide 
bonds and each motif has a partial match to the EGF consensus (see Figure 1). However, 
because the degree of similarity is limited and since the pattern of its disulphide bonding 
is not known, the designation of the MSP-1 C-terminal fragment as comprised of EGF- 

30 like structures has been regarded as tentative. Other relatively divergent potential EGF- 
like sequences occur in Plasmodium proteins, but previous structure determinations have 
been confined to those from metazoan organisms (Campbell et aL, 1998). 
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A number of studies have implicated MSP-1 as the target of a protective immune 
response. Although the goal of this work is to develop a malaria vaccine for use in 
humans, out of necessity most of this experimental work has been done either in model 

5 animal systems or in vitro. These include studies of the effect of specific antibodies on 
parasite invasion of erythrocytes in vitro, passive immunisation studies in rodent malaria 
models in laboratory mice and direct immunisation in both rodent and primate malaria 
models using either native protein (derived from the parasite) or recombinant protein 
expressed from parts of the MSP-1 gene in heterologous organisms. Sero-epidemiological 

10 studies have also showed a correlation between human antibody responses to parts of the 
MSP-1 molecule and protection against clinical disease. Much, but not all, of the work 
has focused on the immune response to the C-terminal MSP-1 19. For example some 
monoclonal antibodies that recognise MSP-1 19 prevent red blood cell invasion in in vitro 
cultures (Blackman et al. 9 1 990). Interestingly, these antibodies that inhibit invasion also 

15 inhibit the secondary processing of the 42 kDa fragment, suggesting the mechanism by 
which they work is by steric hinderance of the protease responsible for secondary 
processing (Blackman et al. 9 1994). Since secondary processing goes to completion 
during successful invasion, if it cannot occur then invasion is interrupted. 

20 All of the work described above would suggest'that MSP-1 and in particular polypeptides 
based on the C-terminal sequence that forms the 42 kDa or the MSP-1 19 region, should be 
very good candidates for malaria vaccine development. However, several studies have 
shown that the epitopes or binding sites for antibodies on MSP-1 19 require a correct 
polypeptide tertiary structure, and that this is destroyed by treatments that reduce the 

25 disulphide bonds that are postulated to be present between the cysteine residues present in 
MSP-1 19. This limitation appears to have been overcome by the expression of 
recombinant protein in ways that allow antibodies that recognise the native parasite MSP- 
1 to bind. Other investigators have suggested that other parts of MSP-1 also have 
potential for inclusion in a vaccine, however the MSP-1 C-terminal fragment is currently 

30 the lead candidate for development of a vaccine against the blood stages of the malaria 
parasite (Diggs et al. 9 1993; Stoute et a/., 1998). 
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As stated above, every -48 hours P. falciparum merozoites are released from the infected 
erythrocyte to re-invade new red blood cells and during this time they are exposed to the 
host's immune system. Therefore, the question arises as to how the parasite has evolved to 
avoid the potentially lethal effects of, for example, neutralising antibodies. In other 
5 infectious micro-organisms it is clear that there is a constant battle between the immune 
system and the micro-organism, and that sophisticated mechanisms have been evolved by 
micro-organisms to evade the immune response. For example antigenic variation and 
antigenic diversity are two mechanisms that involve presenting the immune system with 
"a moving target" such that even though an immune response to one variant of the micro- 

10 organism may kill that variant, new variants are produced that are at least partially or fully 
resistant to the immune response. In the case of malaria merozoites and in particular 
MSP-1 an alternative mechanism has been proposed whereby the binding of some 
antibodies ("blocking antibodies") can prevent the binding of neutralising antibodies and 
thereby allow the parasite to successfully invade a red blood cell even in the presence of 

15 neutralising antibodies (Guevara Patino et aL, 1997). These blocking antibodies may be 
of two types, those against epitopes that are formed from amino acids that are distant in 
the linear primary sequence from the epitopes that are the target of neutralising antibodies, 
and those that are against epitopes that overlap with the epitopes of the neutralising 
antibodies. This represents a novel mechanism by which a parasite can evade an effective 

20 immune response, and unlike mechanisms based oh antigenic polymorphism or diversity, 
it is not dependent upon amino acid sequence diversity. 

Some monoclonal antibodies (mAbs) that bind to MSP-1 19 inhibit the proteolytic cleavage 
and erythrocyte invasion, suggesting that cleavage is a prerequisite for invasion 

25 (Blackman et aL, 1994). Other mAbs that bind to the MSP-1 C-terminal fragment do not 
inhibit processing or invasion but block the binding of the inhibitory neutralizing 
antibodies. Other antibodies that bind to MSP-1 19 neither inhibit nor block the binding of 
inhibitory antibodies. In the presence of blocking antibodies, inhibitory antibodies are 
ineffective and invasion proceeds. The balance between inhibitory and blocking 

30 antibodies induced by immunisation may be a critical factor in determining whether or not 
the immune response is effective in preventing invasion (Guevara Patino et aL, 1997). 
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An object of the present invention is therefore to provide an effective vaccine against the 
malaria parasite based on variants of the Plasmodium MSP-1 protein. In designing such a 
5 vaccine, the following criteria should be met: 

I. The amino acid sequence of the polypeptide to be used in the vaccine should 
contain epitopes that are the targets of, and can induce, neutralising antibodies. 

10 2. The polypeptide should ideally not include amino acid sequences that only form 
epitopes for blocking antibodies. 

3. If the polypeptide contains epitopes for both neutralising and blocking antibodies 
then it should be modified to remove the blocking antibody epitopes without affecting the 
1 5 neutralising epitopes. 

To assist in the design of candidate vaccine polypeptides fulfilling these three criteria, it is 
important to determine the three-dimensional structure of the MSP-1 C-terminal fragment 
since this will help in mapping sites of antibody interactions with this fragment. We have 
20 therefore determined the solution structure of the MSP-1 C-terminal, including the pattern 
of disulphide bonding, using NMR techniques. 

We have made amino acid substitutions in the sequence of MSP-1 19 that prevent the 
binding of individual blocking monoclonal antibodies, without affecting the binding of 

25 neutralising antibodies. By determining the 3-dimensional structure of MSP-1 19 we have 
identified where these antibody binding sites are located in the tertiary structure and this 
has allowed other amino acid substitutions to be made that have similar properties. We 
have shown that several substitutions, each affecting the binding of one or more blocking 
antibodies can be combined into a single molecule, and that these modified molecules 

30 continue to bind the neutralising antibodies but fail to bind any of the blocking antibodies. 
Such modified molecules are expected to be much more effective than the natural or wild- 
type protein structure at inducing a protective neutralising antibody response when used to 
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immunise individuals as a malaria vaccine. In addition we have made other modifications 
in the primary structure of the molecule which do not affect the binding of the neutralising 
antibodies but which may contribute to increased immunogenicity of the molecule. The 
modified MSP-1 | 9 structures, either alone or coupled to other carriers, which may or may 
5 not contain other parts of MSP-1 to enhance the immunogenicity (for example a 
combination of the remainder of the MSP-I42 kDa fragment with the modified MSP-1 19) 
and provide additional T cell epitopes, would be more effective vaccines than equivalent 
structures that have not been modified in this way. 

10 Accordingly, the present invention provides a non-naturally occurring variant of a 
C-terminal fragment of a Plasmodium merozoite surface protein- 1 (MSP-1) wherein said 
variant has (i) a reduced affinity, compared with a naturally occurring Plasmodium 
MSP-1 19, for at least one first antibody capable of blocking the binding of a second 
antibody, which second antibody inhibits the proteolytic cleavage of Plasmodium 

15 MSP-I42 and (ii) substantially the same affinity for said second antibody compared with 
said naturally occurring Plasmodium MSP- 1 19. 

Preferably, the Plasmodium MSP-1 19 and MSP-I42 are Plasmodium falciparum MSP-1 19 
and MSP-1 42. 

20 

The first antibody is preferably selected from mAbs IE1, 2.2, 7.5, 9C8 and 111.4. The 
second antibody is preferably selected from mAbs 12.8, 12.10 and 5B1. 

The present invention further provides a non-naturally occurring variant of a C-terminal 
25 fragment of a Plasmodium merozoite surface protein- 1 (MSP-1) comprising an amino acid 
modification at any one of amino acid residues 14, 15, 27, 31, 34 , 43 48 and 53 of the 
Plasmodium falciparum MSP-1 19 amino acid sequence shown as SEQ I.D. No. 1 or their 
equivalent positions in other Plasmodium MSP-1 19 polypeptides. 

30 Preferably said modifications are substitutions selected from Gin 14— >Arg, Glnl4-»Gly, 
Asnl 5->Arg, Glu27-*Tyr, Leu3 1 ->Arg, Tyr34-*Ser, Tyr34->Ile, Glu43-*Leu, 
Thr48-»Lys and Asn53-»Arg and their equivalents in other Plasmodium MSP-1 19 
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polypeptides. More preferably said substitutions are combinations of substitutions 
selected from [Glu27->Tyr, Leu31->Arg and Glu43->Leu], [Glu27->Tyr, Leu31->Arg, 
Tyr34->Ser and Glu43-»Leu], [Asnl5->Arg, Glu27->Tyr, Leu3l->Arg and 
Glu43->Leu] and their equivalents in other Plasmodium MSP-1 19 polypeptides. 

5 

In a preferred embodiment, a variant MSP-1 polypeptide of the invention further 
comprises a mutation at Cysl2 and/or Cys28 of the Plasmodium falciparum MSP-1 19 
amino acid sequence shown as SEQ I.D. No. 1. Preferably such modifications are 
substitutions selected from Cysl2— ►He and Cys28— >Trp, and Cysl2— >Ala and 
10 Cys28-^Phe. 

Most preferably the substitutions are combinations selected from [Cysl2— ►lie, Asn 
15-^Arg, Glu27— Tyr, Cys28^Trp, Leu31-+Arg, Glu43-*Leu], [Cysl2-+Ile, Asn 
15-*Arg, Glu27->Tyr, Cys28-*Trp, Leu31-+Arg, Glu43— >Leu, Asn53-»Arg], 
15 [Cysl2^Ile, Asn 15-+Arg, Glu27^Tyr, Cys28-^Trp, Leu31->Arg, Tyr34-+Ser, 
Glu43— *Leu, Asn53— >Arg] and their equivalents in other PlasmodiumMS?-\\9 
polypeptides. 

The present invention also provides a method for producing a Plasmodium MSP-1 variant 
20 for use in preparing a vaccine composition which method comprises modifying one or 
more amino acid residues of a Plasmodium MSP-1 C-terminal fragment such that the 
resulting derivative has (i) a reduced affinity, compared with a naturally occurring 
Plasmodium MSP-1 19, for at least one first antibody capable of blocking the binding of a 
second antibody, which second antibody inhibits the proteolytic cleavage of Plasmodium 
25 MSP-1 42 and (ii) substantially the same affinity for said second antibody compared with 
said naturally occurring Plasmodium MSP-1 19. In particular the method of the invention 
preferably comprises as a preliminary step, selecting a candidate amino acid residue by 
reference to a three dimensional NMR model structure, preferably as set out in Table 2. 
More specifically, the 3D model structure is used to select a surface exposed amino acid 
30 residue. Advantageously, a further step is included of computer modelling the three 
dimensional structure of the variant to exclude polypeptides that do not fold correctly. 
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The present invention also provides a non-naturally occurring Plasmodium MSP-1 variant 
obtained by the method of the invention. 

In a further aspect, the present invention provides a polynucleotide encoding a variant of 
5 the invention operably linked to a regulatory sequence capable of directing the expression 
of said nucleotide in a host cell. The polynucleotide may comprise a sequence which has 
been optimised for expression in the host cell. The host cell may be a Pichia pastoris cell. 
Also provided is a nucleic acid vector comprising a polynucleotide of the invention, 
including viral vectors, and a host cell comprising a nucleotide or vector of the invention. 

0 

In another aspect, the present invention provides a pharmaceutical composition 
comprising a variant of the invention, a polynucleotide of the invention or a vector of the 
invention together with a pharmaceutical^ acceptable carrier or diluent. 

Preferably, the composition further comprises an immunogenic Plasmodium polypeptide 
or fragment or derivative thereof such as MSP- 1 33 or a fragment or derivative thereof 
which may be covalently attached to the non-naturally occuring MSP-1 19. It is preferred 
not to use wild-type MSP-1 19 sequences. The further immunogenic peptide may itself be 
derivatised in an analogous manner as described above for MSP-1 19. Thus, epitopes 
present in the peptide may be identified and modified to prevent binding of blocking 
antibodies, without affecting the binding of neutralising antibodies. These epitopes may be 
capable of binding to antibodies which have similar properties to the first antibody 
described above, for example, in binding affinity. The further immunogenic peptide may 
comprise several such modifications in its amino acid sequence. 

The present invention also provides a method for producing anti-MSP-1 antibodies which 
method comprises administering a polypeptide variant of the invention, or a 
polynucleotide of the invention or a vector of the invention to a mammal, typically a non- 
human mammal. 

In a preferred embodiment, the present invention provides a method for producing 
polyclonal anti-MSP-1 antibodies which method comprises administering a polypeptide 



20 



25 
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variant of the invention, or a polynucleotide of the invention or a vector of the invention to 
a mammal, typically a non-human mammal, and extracting the serum from said mammal. 
Also provided is an antibody produced by the said methods. 

5 The polypeptides, nucleotides and vectors of the present invention may be used in 
methods of treating and/or preventing malaria caused by Plasmodium species, in particular 
Plasmodium falciparum. Accordingly, the present invention provides a method of 
inducing immunity against malaria induced by Plasmodium falciparum which comprises 
administering to a person in need of such immunity an effective amount of a variant, a 
10 polynucleotide or a vector of the invention. 

Also provided is a method of immunizing a mammal, said method comprising 
administering an effective amount of a variant, a polynucleotide or a vector of the 
invention. In particular, said mammal is immunized against malaria. Preferably the 
1 5 mammal is a human. 

The present invention also provides a method of treating a malaria infection in a human 
patient which comprises administering to the patient an effective amount of the 
pharmaceutical composition of the invention. 

20 

We further provide according to the present invention a nucleic acid encoding a 
Plasmodium MSP-1 polypeptide, in which the nucleic acid is optimised for expression in 
a heterologous host cell.. Preferably, the heterologous host is a Pischia pastoris cell. The 
MSP-1 polypeptide may be selected from the group comprising an MSP-1 42 polypeptide 

25 comprising a sequence shown in Figures 2C and 2E, an MSP-1 19 polypeptide comprising 
a sequence shown in Figure 2C, and an MSP-1 33 polypeptide comprising a sequence 
shown in Figure 2E. The optimised nucleic acid may comprise a sequence selected from 
the sequences of Figure 2A, Figure 2B and Figure 2D. We further provide a vector 
comprising such a nucleic acid, a host cell comprising such a vector, and a pharmaceutical 

30 composition comprising such a nucleic acid or a vector, together with a pharmaceutically 
acceptable carrier or diluent. The pharmaceutical composition may further comprise an 
immunogenic Plasmodium polypeptide or fragment or derivative thereof. 
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Detailed description of the invention 

Although in general the techniques mentioned herein are well known in the art, reference 
5 may be made in particular to Sambrook et al., Molecular Cloning, A Laboratory Manual 
(1989) and Ausubel et al. y Current Protocols in Molecular Biology (1995), John Wiley & 
Sons, Inc. 

A. MSP-1 variant polypeptides 

10 

The variant MSP-1 polypeptides of the present invention will be described with reference 
to Plasmodium falciparum MSP-1 amino acid sequences. However, it should be 
appreciated that except where otherwise stated, all references to MSP-1 polypeptides 
include homologues of MSP-1 found in other Plasmodium species, such as P. v/vox, 
15 P. malariae and P. ovale which all infect humans and P. yoelii which infects mice. 

The variant MSP-1 polypeptides of the present invention are based on C-terminal 
fragments of the Plasmodium falciparum MSP-1 42 polypeptide shown as SEQ I.D. Nos. 2 
or 3. Such polypeptides will comprise some or all of the MSP-1 19 region (SEQ LD. No. 
20 1), preferably at least substantially all of the domain 1 and/or domain 2 EGF-like 
sequences found in MSP-1 19 (approximately amino acids 1-47 and amino acids 48-96, 
respectively, of SEQ I.D. No. 1). It is particularly preferred to use regions that are 
conserved in most, more preferably all parasites of a single species to increase the 
effectiveness of the variant as a vaccine against a wide range of strains. 

25 

Variant MSP-1 polypeptides of the present invention comprise modifications to their 
primary amino acid sequence that reduce the ability of blocking antibodies to bind to the 
MSP-1 polypeptides. In addition, any modifications made should maintain epitopes 
recognised by neutralising antibodies such that the affinity of the neutralising antibodies 
30 for the MSP-1 variant is substantially the same as for naturally-occurring MSP-1 
polypeptides (such as an MSP-I42 polypeptide having the sequence shown in SEQ I.D. 
Nos. 2 or 3). Some reduction in the binding of some neutralising antibodies may be 
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tolerated since the primary objective is to inhibit the binding of blocking antibodies and it 
is likely that an effective reduction in the binding of blocking antibodies will compensate 
in terms of overall vaccine efficacy for a small reduction in neutralising antibody binding. 

5 Neutralising antibodies in the context of the present invention are antibodies that inhibit 
malaria parasite replication. A variety of neutralising antibodies, polyclonal and 
monoclonal, are known in the art, including mAbs 12.8, 12.10 and 5B1 referred to in the 
Examples. The activity of neutralising antibodies can be determined in a variety of ways 
that have been described in the art. For example, a convenient assay method described in 
10 Blackman et al. y 1994 involves using preparations of merozoites (Blackman et aL, 1993; 
Mrema et aL, 1982) to measure cleavage of MSP-I42 into MSP-I33 and MSP-1 19. Briefly, 
freshly isolated merozoites are washed in ice-cold buffer and divided up into aliquots of 
about 2xl0 9 merozoites. A test antibody is added to each aliquot and the sample 
incubated at 37°C for 1 hour. The samples are then subjected to SDS-PAGE under non- 
15 reducing conditions on a 12.5% polyacrylamide gel, Western blotted and the blot probed 
with antiserum to MSP-1 33. In the control sample, two main bands are seen - one 
corresponding to MSP-1 42 and one lower molecular weight band corresponding to 
MSP-1 33. Neutralising antibodies will reduce the amount of the lower molecular weight 
band as a result of inhibiting secondary proteolytic processing of MSP-1 42. 

20 

This method is a particularly preferred method for assessing the efficacy of neutralising 
antibodies in the presence of antibodies believed to act as blocking antibodies. Where 
candidate competing blocking antibodies are to be tested, the merozoite sample is 
preincubated with a blocking antibody for 15 mins on ice prior to incubation with a 
25 neutralising antibody at 37°C for 1 hour as described above. Thus blocking antibodies can 
readily be identified and/or characterised using such an assay method. 

Other assay methods include merozoite invasion inhibition tests as described in Blackman 
etal., 1990. 

30 

As discussed above, blocking antibodies are defined in the context of the present 
invention as antibodies that inhibit the binding of neutralising antibodies to MSP-1 but 
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which do not themselves inhibit invasion of red blood cells by malaria parasites. Thus 
they "block" the neutralising function of the neutralising antibodies. A variety of blocking 
antibodies have been characterised in the art, including mAbs IE1, 2.2, 7.5 and 111.4 
referred to in the Examples. As discussed above, blocking antibodies can conveniently be 
5 identified and/or characterised using assays that test their effect on neutralising antibody 
function. 

Modifications that may be made to produce MSP-1 variants of the invention include 
substitutions, deletions and insertions. It is particularly preferred to use substitutions to 

10 minimise disruption of the secondary/tertiary structure of the polypeptide. Furthermore, 
particularly preferred substitutions are those that replace one class of amino acid with 
another class, such as an aliphatic non-polar residue with a charged polar residue. For 
example, the twenty naturally occurring amino acids may be divided into four main 
groups (aliphatic non-polar [G, A, P, I, L and V], polar un-charged [C, S, T, M, N and Q], 

15 polar charged [D, E, K and R] and aromatic [H, F, W and Y]) and it is preferred to replace 
an amino acid from one group with an amino acid from another group. 

Other possibilities include replacing a positively charged side chain with a negatively 
charged side chain, replacing an amino acid with a large side chain with an amino acid 
20 with a smaller or no side chain (glycine), replacing a polar amino acid with a charged 
polar amino acid, replacing a large aromatic amino acid with an amino acid with a small 
side chain, and replacing cysteine residues that are involved in disulphide bonds. 

Particularly preferred modifications are an amino acid modification at any one of amino 
25 acid residues 14, 15, 27, 31, 34 , 43, 48 and 53 of the Plasmodium falciparum MSP-1 19 
amino acid sequence shown as SEQ I.D. No. 1 or their equivalent positions in other 
Plasmodium MSP-1 19 polypeptides. These residues are all almost within the EGF-like 
domain 1. It is known that the epitopes of some antibodies contain amino acid sequences 
that are within EGF-like domain 2, therefore equivalent modifications may also be made 
30 in EGF-like domain 2. Preferred examples of modifications include the following 
substitutions Glnl4->Arg, Glnl4->Gly, Asnl5->Arg, Glu27->Tyr, Leu31-*Arg, 
Tyr34-»Ser, Tyr34->Ile, Glu43->Leu, Thr48->Lys and/or Asn53->Arg and their 
equivalents in other Plasmodium MSP-1 19 polypeptides. 
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It is especially preferred to carry out more than one modification, i.e. to use combinations 
of modifications, such as two or more or three or more. In a preferred embodiment, an 
MSP- 1 variant of the invention comprises a combination of amino acid substitutions 
5 selected from [Glu27->Tyr, Leu31-»Arg and Glu43-»Leu], [Glu27-»Tyr, Leu31->Arg, 
Tyr34->Ser and Glu43-»Leu], [Asnl5->Arg, Glu27-*Tyr, Leu31->Arg and 
Glu43-»Leu] and their equivalents in other Plasmodium MSP-1 19 polypeptides. 

A particularly preferred combination further comprises a modification to Cysl2 and/or 
10 Cys28 (and/or their equivalent residues in EGF-like domain 2) to disrupt the disulphide 
bond. Preferably such modifications are substitutions selected from Cysl2— >Ile and 
Cys28— >Trp, and Cysl2— ►Ala and Cys28— Phe. 

Most preferably the substitutions are combinations selected from [Cysl2— >Ile, Asn 
15 15—Arg, GIu27^Tyr, Cys28-+Trp, Leu31^Arg, Glu43— Leu], [Cysl2— He, Asn 
15->Arg, Glu27— >Tyr, Cys28-+Trp, Leu31— ►Arg, Glu43— >Leu, Asn53-*Arg], 
[Cysl2->Ile, Asn 15-+ Arg, Glu27->Tyr, Cys28->Trp, Leu31-+Arg, Tyr34— ►Ser, 
Glu43— »Leu, Asn53— >Arg] and their equivalents in other Plasmodium MSP-1 19 
polypeptides. 

20 

Substitutions are not confined to using naturally occurring amino acids - non-naturally 
occurring amino acid analogues may also be used, in particular where solid phase 
synthesis is to be used to chemically synthesise the variant, as opposed to recombinant 
technology. 

25 

Modifications to MSP-1 amino acid sequences may be carried out using standard 
techniques such as site-directed mutagenesis using the polymerase chain reaction. 
Alternatively, variants may be obtained by solid phase synthetic techniques. 

30 To determine whether a variant MSP-1 polypeptide produced by modification of its 
primary amino acid sequence complies with the criteria specified above, the affinity of at 
least one neutralising antibody and at least one blocking antibody for the variant 
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polypeptide compared with the naturally occurring MSP-1 sequence may be tested. 
Ideally more than one of each type of antibody should be used, for example two or three. 

The ability of antibodies to bind to the variant and wild-type polypeptides may be 
5 determined using any one of a variety of methods available in the art for determining 
antibody-epitope binding. One such method, described in the Examples, involves the use 
of MSP-1 sequences expressed as fusion proteins with a protein tag such as glutathione-S- 
transferase (GST). These GST-fusion proteins are typically immobilised to a solid phase 
such as glutathione sepharose beads or a BIAcore sensor chip. Binding of antibodies, 
10 such as monoclonal antibodies, to the fusion proteins may be determined using standard 
techniques such as Western blotting and/or by labelling the antibodies with a radioactive 
label such as l25 I. The use of BIAcore technology allows easy quantitation of the results. 



Preferably, the reduction in binding of at least one of the blocking antibodies tested is at 
15 least 50% compared to wild-type MSP-1, more preferably at least 75, 80 or 90%, typically 
as assessed using recombinantly expressed MSP-1 immobilised to a BIAcore sensor chip. 
By contrast, the binding of at least one, for example at least two or three, of the 
neutralising antibodies tested, more preferably at least half of the neutralising antibodies 
tested, more preferably substantially all of the neutralising antibodies tested is reduced by 
20 less than 50%, more preferably less than 25%. The number of neutralising antibodies that 
need be tested to confirm compliance with the test criteria will not typically exceed from 
three to five different antibodies (three antibodies are used in the Examples). In a 
particularly preferred embodiment the binding of at least one neutralising antibody is 
increased by at least 10%. 

25 

The results given in Table 2 in the Examples provide partial guidance to the skilled person 
as to which residues may be modified to produce a variant MSP-1 of the invention. 
However, the provision herein for the first time of the three dimensional solution structure 
of MSP-1 J9 provides the skilled person with further detailed guidance as to which residues 
30 may be altered. In particular, epitopes are expected to be exposed to the aqueous 
environment on the exterior of the MSP-1 19 fragment. Consequently, the precise 
structural information provided which teaches the position of surface exposed amino acids 
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allows the skilled person to target those residues for modification. This data is given in 
Tables A/B and has also been submitted to the Protein Data Bank (PDB Accession no. 
1CEJ). It enables the skilled person to identify the precise location of individual amino 
acids in the three dimensional structure. Typically, the data is loaded into suitable 
5 software, well-known in the art such as Insight II, MOLSCRJPT GRAS P and RASMOL. 

Further, knowing the location of a modification in the 3 -dimensional structure which 
affects the binding of a blocking antibody without affecting the binding of the neutralising 
antibodies, it is possible to identify other residues that are on the surface and in the 

10 vicinity of the original modification and which may be easily modified to further improve 
the properties of a modified protein. These residues may be in either the first or the second 
EGF-like motifs or in the sequence between them. Since it is known that an antibody 
binding site can encompass a volume that corresponds approximately to the range of 5 to 
8 amino acids, it is clear that modifications of these adjacent residues may also affect the 

15 affinity of the protein for the blocking antibodies. Once an adjacent amino acid has been 
identified it can be modified according to the principles outlined above and the 
contribution of the modification to the overall antigenicity and immunogenicity of the 
protein, either alone or in combination with other modifications, can be assessed. Those 
changes that contribute to a reduced affinity for the blocking antibodies, without a 

20 substantial affect on binding of the neutralising antibodies can be incorporated into the 
improved protein. This can be a reiterative process. 

In addition, the 3D NMR structure will enable the skilled person to carry out preliminary 
computer modelling studies of MSP- 1 19 variants with specific modifications so that, for 
25 example variants that cannot fold properly may be discarded. This will assist in 
minimising the number of candidate MSP- 1 19 variants that need be tested. 

Thus the present invention also provides a computer readable medium having stored 
thereon a model of the MSP- 1 19 NMR structure. In a preferred embodiment, said model is 
30 built from all or some of the NMR data shown in Tables A and B. 

Variants of the present invention may optionally include additional MSP-1 sequences, in 
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particular regions of the MSP- 1 33 region of MSP- 1 42 to confer additional immunogenic ity 
to the variant. Furthermore, additional sequences known to contain and promote T cell 
responses are advantageously included (i.e. T cell epitopes). Other modifications may 
also be made that increase immunogenicity such as modifications that alter the pathway of 
5 antigen processing and presentation. 

Polypeptide variants of the invention are typically made by recombinant means, for 
example as described below. However they may also be made by synthetic means using 
techniques well known to skilled persons such as solid phase synthesis. Proteins of the 

10 invention may also be produced as fusion proteins, for example to aid in extraction and 
purification. Examples of fusion protein partners include glutathione-S-transferase 
(GST), 6xHis, GAL4 (DNA binding and/or transcriptional activation domains) and 
P-galactosidase. It may also be convenient to include a proteolytic cleavage site between 
the fusion protein partner and the protein sequence of interest to allow removal of fusion 

15 protein sequences. Preferably the fusion protein will not hinder the immunogenicity of the 
MSP-1 variant. 

Polypeptides of the invention may be in a substantially isolated form. It will be 
understood that the polypeptide may be mixed with carriers or diluents which will not 
interfere with the intended purpose of the polypeptide and still be regarded as substantially 
isolated. A polypeptide of the invention may also be in a substantially purified form, in 
which case it will generally comprise the polypeptide in a preparation in which more than 
90%, e.g. 95%, 98% or 99% of the polypeptide in the preparation is a polypeptide of the 
invention. 

B. Polynucleotides and vectors 

As discussed above, the variants of the present invention may be produced recombinantly 
using standard techniques. Thus, the present invention also provides a polynucleotide 
30 encoding a polypeptide MSP-1 variant of the invention. Polynucleotides of the invention 
may comprise DNA or RNA. They may also be polynucleotides which include within 
them synthetic or modified nucleotides. A number of different types of modification to 
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oligonucleotides are known in the art. These include methylphosphonate and 
phosphorothioate backbones, addition of acridine or polylysine chains at the 3' and/or 5* 
ends of the molecule. For the purposes of the present invention, it is to be understood that 
the polynucleotides described herein may be modified by any method available in the art. 
5 Such modifications may be carried out in order to enhance the in vivo activity or life span 
of polynucleotides of the invention. It will be understood by a skilled person that 
numerous different polynucleotides can encode the same polypeptide as a result of the 
degeneracy of the genetic code. 

10 Polynucleotides of the invention comprise can be incorporated into a recombinant 
replicable vector. The vector may be used to replicate the nucleic acid in a compatible 
host cell. Thus in a further embodiment, the invention provides a method of making 
polynucleotides of the invention by introducing a polynucleotide of the invention into a 
replicable vector, introducing the vector into a compatible host cell, and growing the host 

15 cell under conditions which bring about replication of the vector. The vector may be 
recovered from the host cell. Suitable host cells include bacteria such as E. coli, yeast, 
mammalian cell lines and other eukaryotic cell lines, for example insect Sf9 cells. The 
host cell may be a methylotrophic yeast such as Pichia pastoris, 

20 The coding sequence of natural or variant MSP polypeptides (including the polypeptide of 
the invention) may be modified for optimal expression in a host cell. For example, 
secondary modification such as N-glycosylation may be prevented by removal of 
sequences necessary for such modification. The sequence of the polypeptide may 
alternatively or in addition be modified with respect to codon usage for optimal expression 

25 in the host cell. Methods of mutagenising a sequence are known in the art; alternatively, 
the modified coding sequence may be generated by means of PCR gene assembly using 
overlapping synthetic oligonucleotides (Stemmer et al., 1995; Withers-Martinez et al., 
1999). 

30 Preferably, a polynucleotide of the invention in a vector is operably linked to a regulatory 
sequence that is capable of providing for the expression of the coding sequence by the 
host cell, i.e. the vector is an expression vector. The term "operably linked" refers to a 
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juxtaposition wherein the components described are in a relationship permitting them to 
function in their intended manner. A regulatory sequence "operably linked" to a coding 
sequence is ligated in such a way that expression of the coding sequence is achieved under 
condition compatible with the control sequences. 

5 

Such vectors may be transformed or transfected into a suitable host cell using standard 
techniques above to provide for expression of a polypeptide of the invention. This 
process may comprise culturing a host cell transformed with an expression vector as 
described above under conditions to provide for expression by the vector of a coding 
10 sequence encoding the polypeptides, and optionally recovering the expressed 
polypeptides. 

The vectors may be for example, plasmid or virus vectors provided with an origin of 
replication, optionally a promoter for the expression of the said polynucleotide and 
15 optionally a regulator of the promoter. The vectors may contain one or more selectable 
marker genes, for example an ampicillin resistance gene in the case of a bacterial plasmid 
or a neomycin resistance gene for a mammalian vector. Vectors may be used in vitro, for 
example for the production of RNA or used to transfect or transform a host cell. The 
vector may also be adapted to be used in vivo, for example in a method of gene therapy. 

20 

Promoters/enhancers and other expression regulation signals may be selected to be 
compatible with the host cell for which the expression vector is designed. For example, 
prokaryotic promoters may be used, in particular those suitable for use in E. coli strains 
(such as E. coli HB 1 0 1 or DH5a). 

25 

When expression of the polypeptides of the invention in carried out in mammalian cells, 
either in vitro or in vivo, mammalian promoters may be used. Tissue-specific promoters 
may also be used. Viral promoters may also be used, for example the Moloney murine 
leukaemia virus long terminal repeat (MMLV LTR), the promoter rous sarcoma virus 
30 (RSV) LTR promoter, the SV40 promoter, the human cytomegalovirus (CMV) IE 
promoter, herpes simplex virus promoters or adenovirus promoters. All these promoters 
are readily available in the art. 
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C. Administration 

The variant MSP-1 polypeptides of the present invention and nucleic acid molecules may 
5 be used to treat or prevent malaria in animals, specifically humans. 

The polypeptides of the invention may be administered by direct injection. Preferably the 
polypeptides are combined with a pharmaceutical ly acceptable carrier or diluent to 
produce a pharmaceutical composition. Suitable carriers and diluents include isotonic 

10 saline solutions, for example phosphate-buffered saline. The composition may be 
formulated for parenteral, intramuscular, intravenous, subcutaneous, intraocular or 
transdermal administration. Typically, each polypeptide is administered at a dose of from 
0.01 to 30 |ig/kg body weight, preferably from 0.1 to 10 jig/kg, more preferably from OA 
to 1 p.g/kg body weight. It is also possible to use antibodies prepared using the 

15 polypeptides of the invention, as described below, in treating or preventing Plasmodium 
infection. Neutralising antibodies, or fragments thereof which retain specificity for 
Plasmodium antigens, can be administered in a similar manner to the polypeptides of the 
invention. 

20 The polynucleotides of the invention may be administered directly as a naked nucleic acid 
construct. When the expression cassette is administered as a naked nucleic acid, the 
amount of nucleic acid administered is typically in the range of from 1 |ig to 10 mg, 
preferably from 100 \ig to 1 mg. 

25 Uptake of naked nucleic acid constructs by mammalian cells is enhanced by several 
known transfection techniques for example those including the use of transfection agents. 
Example of these agents include cationic agents (for example calcium phosphate and 
DEAE-dextran) and lipofectants (for example lipofectam™ and transfectam™). 
Typically, nucleic acid constructs are mixed with the transfection agent to produce a 

30 composition. 
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Alternatively, the polynucleotide may be administered as part of a nucleic acid vector, 
including a plasmid vector or viral vector, such as a vaccinia virus vector. When the 
polynucleotide of the invention is delivered to cells by a viral vector of the invention, the 
amount of virus administered is in the range of from 10 3 to 10 10 pfu, preferably from 10 5 
5 to 10 8 pfu, more preferably from 10 6 to 10 7 pfu. When injected, typically 1-10 jjl! of virus 
in a pharmaceutical^ acceptable suitable carrier or diluent is administered. 

Preferably the delivery vehicle (i.e. naked nucleic acid construct or viral vector comprising 
the polynucleotide for example) is combined with a pharmaceutical ly acceptable carrier or 
10 diluent to produce a pharmaceutical composition. Suitable carriers and diluents include 
isotonic saline solutions, for example phosphate-buffered saline. The composition may be 
formulated for parenteral, intramuscular, intravenous, subcutaneous, intraocular or 
transdermal administration. 

1 5 The routes of administration and dosages described are intended only as a guide since a 
skilled practitioner will be able to determine readily the optimum route of administration 
and dosage for any particular patient and condition. 



20 



D. Preparation of Vaccines 



Vaccines may be prepared from one or more polypeptides of the invention. They may 
also include one or more immunogenic Plasmodium polypeptides known in the art. Thus 
a vaccine of the invention may comprise one or more polypeptides of the invention and 
optionally, one or more polypeptides selected from, for example, the asexual blood stage 

25 proteins: apical merozoite antigen- 1, erythrocyte binding antigen 175, erythrocyte 
membrane protein- 1; the hepatic stage proteins: liver stage antigens 1 and 3; the 
sporozoite stage proteins: circumsporozoite protein , thrombospondin related adhesive 
protein; and the sexual stage proteins Pfs25 and Pfs28 polypeptides and immunogenic 
fragments thereof Preferably, the other immunogenic Plasmodium polypeptides known 

30 in the art do not contain wild type MSP- 1 !9 sequences. 

The preparation of vaccines which contain an immunogenic polypeptide(s) as active 
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ingredient(s), is known to one skilled in the art. Typically, such vaccines are prepared as 
injectables, either as liquid solutions or suspensions; solid forms suitable for solution in, 
or suspension in, liquid prior to injection may also be prepared. The preparation may also 
be emulsified, or the protein encapsulated in liposomes. The active immunogenic 
5 ingredients are often mixed with excipients which are pharmaceutical ly acceptable and 
compatible with the active ingredient. Suitable excipients are, for example, water, saline, 
dextrose, glycerol, ethanol, or the like and combinations thereof. In addition, if desired, 
the vaccine may contain minor amounts of auxiliary substances such as wetting or 
emulsifying agents, pH buffering agents, and/or adjuvants which enhance the 

10 effectiveness of the vaccine. Examples of adjuvants which may be effective include but 
are not limited to: aluminum hydroxide, N-acetyl-muramyl-L-threonyl-D-isoglutamine 
(thr-MDP), N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as 
nor-MDP), N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(r-2 , -dipalmitoyl-sn- 
glycero-3-hydroxyphosphoryloxy)-ethylamine (CGP 1983 5 A, referred to as MTP-PE), 

15 and RIBI, which contains three components extracted from bacteria, monophosphoryl 
lipid A, trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) in a 2% 
squalene/Tween 80 emulsion. The effectiveness of an adjuvant may be determined by 
measuring the amount of antibodies directed against an immunogenic polypeptide 
containing an MSP-1 antigenic sequence resulting from administration of this polypeptide 

20 in vaccines which are also comprised of the various adjuvants. 

The vaccines are conventionally administered parenterally, by injection, for example, 
either subcutaneously or intramuscularly. Additional formulations which are suitable for 
other modes of administration include suppositories and, in some cases, oral formulations. 

25 For suppositories, traditional binders and carriers may include, for example, polyalkylene 
glycols or triglycerides; such suppositories may be formed from mixtures containing the 
active ingredient in the range of 0.5% to 10%, preferably 1% to 2%. Oral formulations 
include such normally employed excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium 

30 carbonate, and the like. These compositions take the form of solutions, suspensions, 
tablets, pills, capsules, sustained release formulations or powders and contain 10% to 95% 
of active ingredient, preferably 25% to 70%. Where the vaccine composition is 
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lyophilised, the lyophilised material may be reconstituted prior to administration, e.g. as a 
suspension. Reconstitution is preferably effected in buffer. 

Capsules, tablets and pills for oral administration to a patient may be provided with an 
5 enteric coating comprising, for example, Eudragit "S", Eudragit M L M , cellulose acetate, 
cellulose acetate phthalate or hydroxypropylmethyl cellulose. 

The polypeptides of the invention may be formulated into the vaccine as neutral or salt 
forms. Pharmaceutical ly acceptable salts include the acid addition salts (formed with free 

10 amino groups of the peptide) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids such as acetic, oxalic, 
tartaric and maleic. Salts formed with the free carboxyl groups may also be derived from 
inorganic bases such as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, trimethylamine, 2-ethylamino 

15 ethanol, histidine and procaine. 

E. Dosage and Administration of Vaccines 

The vaccines are administered in a manner compatible with the dosage formulation, and in 
20 such amount as will be prophylactically and/or therapeutically effective. The quantity to 
be administered, which is generally in the range of 5 |ig to 250 fig of antigen per dose, 
depends on the subject to be treated, capacity of the subject's immune system to synthesise 
antibodies, and the degree of protection desired. Precise amounts of active ingredient 
required to be administered may depend on the judgement of the practitioner and may be 
25 peculiar to each subject. 

The vaccine may be given in a single dose schedule, or preferably in a multiple dose 
schedule. A multiple dose schedule is one in which a primary course of vaccination may 
be with 1-10 separate doses, followed by other doses given at subsequent time intervals 
30 required to maintain and or reinforce the immune response, for example, at 1 to 4 months 
for a second dose, and if needed, a subsequent dose(s) after several months. The dosage 
regimen will also, at least in part, be determined by the need of the individual and be 
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In addition, the vaccine containing the immunogenic MSP-1 antigen(s) may be 
administered in conjunction with other immunoregulatory agents, for example, 
5 immunoglobulins. 

F. Preparation of antibodies against the polypeptides of the invention 

The variant MSP-1 polypeptides prepared as described above can be used to produce 
10 antibodies, both polyclonal and monoclonal. If polyclonal antibodies are desired, a 
selected mammal (e.g., mouse, rabbit, goat, horse, etc.) is immunised with an 
immunogenic polypeptide bearing an MSP-1 epitope(s). Serum from the immunised 
animal is collected and treated according to known procedures. If serum containing 
polyclonal antibodies to an MSP-1 epitope contains antibodies to other antigens, the 
15 polyclonal antibodies can be purified by immunoaffinity chromatography. Techniques for 
producing and processing polyclonal antisera are known in the art. 

Monoclonal antibodies directed against MSP-1 epitopes in the polypeptides of the 
invention can also be readily produced by one skilled in the art. The general methodology 
20 for making monoclonal antibodies by hybridomas is well known. Immortal antibody- 
producing cell lines can be created by cell fusion, and also by other techniques such as 
direct transformation of B lymphocytes with oncogenic DNA, or transfection with 
Epstein-Barr virus. Panels of monoclonal antibodies produced against MSP-1 epitopes 
can be screened for various properties; i.e., for isotype and epitope affinity. 

25 

The polypeptides of the invention can also be used to select for human monoclonal 
antibodies using the variable regions of immunoglobulin heavy and light chains cloned in 
the fonn of a phage display library, preferably from individuals who have been previously 
exposed to a natural malaria infection. 

30 

Antibodies, both monoclonal and polyclonal, which are directed against MSP-1 epitopes 
are particularly useful in diagnosis, and those which are neutralising are useful in passive 
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immunotherapy. Monoclonal antibodies, in particular, may be used to raise anti-idiotype 
antibodies. Anti-idiotype antibodies are immunoglobulins which carry an "internal 
image" of the antigen of the infectious agent against which protection is desired. 

5 Techniques for raising anti-idiotype antibodies are known in the art. These anti-idiotype 
antibodies may also be useful for treatment of Plasmodium infections, as well as for an 
elucidation of the immunogenic regions of MSP-1 antigens. It is also possible to use 
fragments of the antibodies described above, for example, F(ab') 23 Fab, Facb and scFv 
fragments. 

to 

It should be appeciated that features from various sections, aspects and embodiments of 
the invention as described above are generally equally applicable to other sections, aspects 
and embodiments mutatis mutandis. 

15 The invention will now be further described by way of Examples, which are meant to 
serve to assist one of ordinary skill in the art in carrying out the invention and are not 
intended in any way to limit the scope of the invention. The Examples refer to the 
Figures. In the Figures: 

20 Detailed Description of the Figures 

Figure 1 - MSP-1 sequences aligned according to the EGF-like motif consensus. Top 
sequence: P. falciparum (SWISS-PROT MSP1 PLAFW). Second sequence: P. vivax 
Belem strain (PIR A45604). Third sequence: human EGF (PDB legf). Fourth sequence: 
25 EGF-like domain consensus (Prosite EGF1). Bottom sequence: 14 residue EGF core 
region used for structure alignment in Figure 6. Black highlighting indicates conserved 
residues of the EGF-like domain. Dark shading shows hydrophobic residues at the EGF- 
module pair interface in the P. falciparum, and corresponding conserved residues in the 
P. vivax sequence. 

30 

Figure 2 - Sample of multidimensional heteronuclear NOESY experiments showing 
planes containing NOE connections to the MSP-1 C-terminal fragment Lys35 NH proton. 
Top: l3 C (D4) and 'H(D3) plane from the 4D-[ 13 C]-HMQC-NOESY-[ l5 N]-HSQC 
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experiment, taken at the chemical shift values of Lys35 NH in l5 N(D2) and 'H(Dl). 
Bottom: strip from the 3D [ l5 N]-NOESY-HSQC at the ! H chemical shift value of Lys35 
NH (vertical axis, Dl) taken at the plane of its ,5 N (D3) value. The horizontal ! H axis is 
aligned with that of the top spectrum. The weak cross-peaks at 2.72 and 3.01 ppm in the 
5 3D spectrum do not show corresponding cross-peaks in the 4D spectrum because of the 
lower signal-to-noise ratio in the latter. These peaks have been assigned as the cross-peaks 
between Lys35 NH and Asn44 H p2 (2.72 ppm), and Cys30 H p3 and/or Cys41 H p2 (3.01 
ppm). 

10 Figure 3 - Stereo drawing showing the backbone C, N, C a atoms of the 32 refined 
structures in the final ensemble. The domain- 1 is on the left (red), with domain-2 on the 
right (green), and both the N- and C-termini are near the bottom. 

Figure 4 - MOLSCRIPT picture of the most representative model of the ensemble, 
1 5 showing the backbone C a trace, antiparallel P-sheet elements, and disulphide bridges (S Y 
atoms in yellow). Domain- 1, red; Domain-2, green. 

Figure 5 - Alignment of typical EGF-like family members with the fitpdb program, using 
the 14 amino acid "reduced core" consensus (Bersch et aL 9 1998) (see Figure 1). The 

20 aligned backbone segment in each structure is white. The structures are aligned relative to 
the most representative structure of the group (factor Xa), with increasing divergence from 
left to right. Numbers indicate the rmsd value of the aligned C, N, C a atoms. PDB 
identification codes: factor Xa (crystal structure), lhcg; Complement Clr component, 
lapq (14 th model); human EGF, legf (1 1 th model); fibrillin-1, domains-32 and -33^1emn 

25 (minimized average structure); transforming growth factor-ot, 2tgf (minimized average 
structure); MSP-1 domains- 1 and -2, this study. 

Figure 6 - Backbone ribbon view of fibrillin- 1 versus MSP-1 EGF module pair 
arrangements. Fibrillin-1 (lemn) cyan (domain-32) and magenta (domain-33) (Downing 
30 et al. 9 1996); MSP-1 domain- 1 (yellow) and domain-2 (green). Structures were aligned as 
in Figure 6 by the core consensus of the N-terminal domain of each pair. The bound Ca 2+ 
ions in the fibrillin- 1 structure are shown as magenta spheres. 
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Figure 7 - Two views, a and b, (rotated 180° about the y-axis) of the electrostatic potential 
surface of the MSP-1 EGF module pair, calculated with GRASP. Red indicates negative 
charge, blue indicates positive charge, and white is neutral. The orientation of the views 
5 is shown by the adjacent worm diagrams. 

Figure 8 - CPK model of the MSP-1 C-terminal fragment, showing the location of some 
mutations that affect binding of monoclonal antibodies. Domain- 1 is towards the top and 
right sides, and domain-2 towards the bottom left. 

10 

Figure 9 - Examples of the binding of monoclonal antibodies to GST-MSP-1 19 detected by 
Western blotting. The binding of each monoclonal antibody to protein based on the wild 
type sequence and to proteins containing modified sequences is shown. The monoclonal 
antibodies are shown across the top. On the left is shown the proteins: WT, wild type 
15 sequence; 22, Leu22 to Arg; 26, Glu26 to He; 15, Asnl5 to Arg; 27, Glu27 to Tyr; 31, 
Leu31 to Arg; 43, Glu43 to Leu; 27+31+43, Glu27 to Tyr and Leu31 to Arg and Glu43 to 
Leu; 1 5+27+3 1+43, Asnl5 to Arg and Glu27 to Tyr and Leu3 1 to Arg and Glu43 to Leu. 

Figure 10 - The binding of monoclonal antibodies to GST-MSP-1 19 detected by BIAcore 
20 analysis. The binding of each monoclonal antibody is normalised to 100% binding to 
protein based on the wild type sequence and the binding of proteins containing modified 
sequences is expressed as a percentage of this. WT, wild type sequence; 15, Asn 15— ►Arg; 
26, Glu26— lie; 27, Glu27-^Tyr; 31, Leu31^Arg; 34, Tyr34-+Ser; 43 Glu43-+Leu. 

25 Figure 11 - The binding of monoclonal antibodies to GST-MSP-1 19 containing multiple 
modifications detected by BIAcore analysis. The binding of each monoclonal antibody is 
normalised to 100% binding to protein based on the wild type sequence and the binding of 
proteins containing modified sequences is expressed as a percentage of this. WT, wild 
type sequence; The combinations contain 3 mutations [27+31+43], or 4 mutations 

30 ([27+31+34+43] and [15+27+31+43]), at each site the changes are those identified in 
Figure 10. 
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Figure 12 - Identification of blocking antibodies using a competitive binding assay and 
immobilised wild type GST-MSP-1 19. The ability of antibodies to compete with the 
binding of mAbs 12.8 and 12.10 to GST-MSP-1 19 was measured using BIAcore analysis. 
Individual antibodies (x-axis) were bound to the antigen and then the amount of either 
5 12.8 or 12.10 (inhibitory mAb) that could subsequently bind was quantified. The amount 
of binding is presented as a percentage of the total amount of either 12.8 or 12.10 bound in 
the absence of pre-incubation with another antibody. 

Figure 13 - Antibodies induced by immunisation with a modified recombinant MSP- 1 19 
10 assayed for their ability to inhibit secondary processing. Washed 3D7 rnerozoites were 
either analysed directly without incubation (0 h) or incubated for 1 hour at 37°C in the 
presence of no serum (no serum), 1 mM PMSF as a control for complete inhibition, 
normal rabbit sera (normal serum), or serum from a rabbit immunised with the 
15+27+31+43 modified protein (immune serum), all at 1:10 dilution in reaction buffer. 
15 The level of MSP- 1 33 released into the supernatant as a results of secondary processing 
was measured using an ELISA method and is represented by Absorbance at 492nm. 

Figure 14. Pichia pasioris codon preference table used for input to the CODOP program. 

20 Figure 15. DNA and protein sequences for the optimized synthetic MSP-142 gene. A: 
Complete sequence designed for optimum codon usage and expression in P. pastor is. B: 
Sequence of the synthetic MSP-119 construct in the expression vector pPIC9K-HXa. 
Uppercase letters: vector sequences, including the His6 tag and factor Xa cleavage site 
(IEGR). Lowercase letters: synthetic MSP-1 19 coding sequence. The cloned sequence in 

25 located at the SnaBI restriction site of the pPIC9K sequence. C: Expressed protein 
sequence of the synthetic MSP-119 construct. The sequence shown is produced as a 
fusion to the pPIC9K a- factor secretion signal, following the kex2/STE13 processing 
sites. The synthetic MSP-1 19 is in bold-face type. D: Sequence of the MSP-133 construct. 
The cloned sequence is located at the Smal site of the pUC118 vector. E: Predicted 

30 protein sequence of the synthetic MSP-133 construct translation product. 

Figure 16. Gene assembly PCR reactions for the MSP-133 and MSP-119 sequences. 
Reaction 1: 10 jiL aliquots of the assembly reactions. Reaction 2: 20 jxL aliquots of the 
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amplification reactions. The N-terminal and middle fragments were subsequently spliced 
together to form the MSP- 133 synthetic construct. The C-terminal fragment synthesis 
reactions produced the optimized MSP-1 19 construct. 

5 Figure 17. Expression of the synthetic MSP-1 19 protein in P. pastoris. Lanes 1-6: 
trichloroacetic acid precipitates of secreted recombinant protein from culture supernatants, 
without further purification (5 each). Samples from duplicate cultures of three 
independent transformants. Lane 8,9: purified, deglycosylated MSP-1 19 produced from 
the original P. falciparum sequence. Lane 7,10: NOVEX molecular weight markers. 

10 

Figure 18. A: { ! H/ 15 N}-HSQC spectrum of the protein (2.5 mM) expressed from the 
optimized synthetic MSP-1 19 gene. B: Control {'H/^NJ-HSQC of deglycosylated protein 
(2.2 mM) expressed from the original P. falciparum sequence (Morgan et al., 1999). 

15 Examples 

Materials and Methods 

Protein expression and stable-isotope labelling for NMR 

20 

The coding sequence of the MSP-1 C-terminal fragment was cloned by polymerase chain 
reaction with Vent polymerase (New England Biolabs) from a plasmid containing the 
Plasmodium falciparum strain T9/94 fragment (Blackman et al, 1991), using primers that 
included codons for a 6 residue N-terminal His tag (CACCATCATCATCATCAC), and 

25 inserted into the SndRl restriction site of the pPIC9K vector (Invitrogen). The sequence 
corresponds to residues 1526-1621 of the SWISS-PROT entry MSP1 PLAFW (accession 
number P04933). This produced an a-factor fusion protein with the sequence 
...KR/EA/EAA r HHHHHHNISQ....SSSN, where the slashes indicate kex2 and STE13 
processing sites. High copy number transformants of the methylotrophic yeast 

30 Komagataella (Pichia) pastoris protease-deficient strain SMD1168 (his4 pep4) were 
isolated by screening for high G41 8 resistance (Clare et al., 1995). 

A Mut + transformant was grown at 29.4 °C in a shaker-incubator in buffered minimal 
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medium (100 mM potassium phosphate, pH 6.0, yeast nitrogen base (0.34 % w/vol) 
(DIFCO: YNB without amino acids and without (NH 4 ) 2 S0 4 ), biotin (4xl0" 5 % w/vol), 
Sigma antifoam 289 (0.01% vol/ vol), and carbon and nitrogen sources as described below. 
Unlabelled samples were initially grown in medium containing 1 % w/vol (NH^SO* and 

5 1 % w/vol glycerol, and induced by transfer to medium containing 0.5 % CH3OH as the 
carbon source. Labelled samples were initially grown in medium containing 0.2 % w/vol 
[ I5 N]-(NH 4 )2S0 4 (Isotech), and 0.5 % w/vol glucose or [ l3 C 6 ]-glucose (Isotech), and 
induced by transfer to medium containing as carbon source 0.5 % w/vol CH3OH or [ ,3 C]- 
CH3OH (Isotech). The initial cultures were grown in 150 ml to a density of -10 OD 6 oo, 

10 then harvested and resuspended in methanol medium at 1 OD600 in a volume of 1.5 L. 
Methanol-induced cultures were grown for 4 d, with daily addition of 7.5 ml CH3OH or 
[ l3 C]-CH 3 OH, to a final density of -18 OD6<x>. This protocol produced a maximum yield 
of 24 mg/L of purified, l3 C/ 15 N uniformly labelled protein at the final stage (see below). 
The YNB-based medium produced about 3-fold higher yields than the FM22 medium 

15 (Laroche et aL, 1994), for stable-isotope labeling of the MSP-1 C-terminal fragment. 



Cells were removed by low-speed centrifugation, protease inhibitors added 
(COMPLETE™ tablets, Boehringer-Mannheim; 1 tablet/500 ml supernatant), and the 
supernatant was filter-sterilized. The supernatant was concentrated -20-fold by 

20 ultrafiltration in a stirred cell (Amicon, YM3 membrane) at 4 °C. The pH was adjusted to 
7.25 with KOH, and the partially N-glycosylated MSP-1 fragment was deglycosylated for 
72 h at 37 °C with 5000 U PNGaseF (New England Biolabs). The carbohydrate was 
completely removed (as shown by electrophoresis and mass spectrometry), with the Asn 1 
residue presumably converted to Asp in the process. The supernatant was clarified by low- 

25 speed centrifugation, 5 M NaCl was added to a final concentration of 0.3 M, and the 
sample was applied to a 2 ml Ni-NTA affinity column (QIAGEN), washed, and eluted 
with 250 mM imidazole according to manufacturer's instructions. The eluate was 
dialyzed against 50 mM sodium phosphate (pH 6.5), 50 mM NaCl, and then passed 
through a 1 ml Hi-Trap Q anion exchange resin (Pharmacia) to remove misfolded MSP-1 

30 that bound to the column. The MSP-1 fragment was characterized by Western blotting and 
electrospray mass spectrometry (data not shown). Two principal species of mass 1 1607 
and 11807 Da were observed corresponding to the expected fragment, as well as a 
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fragment with an additional N-terminal Glu - Ala dipeptide resulting from incomplete 
STE1 3 processing of the a- factor secretion signal. 

Samples for NMR experiments were prepared in either 90 % H 2 O/10 % D 2 0 with 0.01 % 
5 w/vol NaN 3 , or 100 % D 2 0, 50 mM sodium phosphate, 100 mM NaCl at pH 6.5, (pH 
uncorrected for deuterium isotope effects), at a concentration of 2.1 to 2.6 mM in 0.6 ml. 
Protein concentration was measured by UV absorbance at 280 nm, using a calculated 
molar extinction coefficient of 5220 liter mol" 1 cm" 1 . The protein was demonstrated to be 
monomeric by equilibrium ultracentrifiigation of a 0.12 mM sample in the above buffer at 
10 293 K. 

NMR experiments and data processing 

Most of the experiments were performed at 298 K, using Varian Unity and Unity-Plus 
15 spectrometers operating at 600 MHz and 500 MHz respectively. Details of the 
multidimensional experiments (Clore & Gronenborn, 1998) and acquisition parameters 
used for resonance assignments and structure determination are given in Tables A/B and 
have been submitted to the Protein Data Bank database (PDB Accession No 1CEJ). 

20 All spectra were processed using Felix 95.0 or 97.0 (Biosym/MSI) using a 90 degree- or 
72 degree-shifted sinebell-squared window function. Dimensions, zero-filling, and linear 
prediction details are summarized in Tables A/B and in the submission to the 
BioMagResBank. Four dimensional and interleaved spectra were processed in Felix using 
macros written in-house. 

25 

Signal assignments: Sequential assignments were made based on connectivities 
established primarily by CBCA(CO)NH and CBCANH experiments on uniformly l3 C/ l5 N 
labelled protein. Side chain-spin system assignments were made on the basis of data from 
13 C/ ! H -HCCH-TOCSY experiment correlated with information from ^N/'H-TOCSY- 
30 HSQC and ^N/'H-NOESY-HSQC, and HNHA and HNHB experiments. Assignments 
were obtained for 'H, i5 N and aliphatic ,3 C signals for 98% of side-chains and 96% of 
backbone amide groups. The list of assignments is given in Tables A/B and in the 
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submission to the Protein Data Bank database (PDB Accession No 1CEJ). The l5 N{ l H} 
heteronuclear NOE experiment was carried out as described previously (Kay et al. t 1989; 
Polshakove/a/., 1997). 

5 Distance Restraints: NOE- and ROE-derived distance restraints between backbone and 
side chain amide protons were obtained primarily from the 3D ,5 N-NOESY-HSQC, 
l5 N-ROESY-HSQC, and 4D !3 C-HMQC-NOESY- 15 N-HSQC experiments. Aliphatic to 
aliphatic proton distance restraints were obtained from a 4D 13 C-HMQC-NOESY- 13 C- 
HSQC experiment. A 3D 13 C-HMQC-NOESY experiment in D 2 0 was used to identify 

10 aliphatic to aromatic proton NOEs and 2D NOESY experiments were used to measure 
aromatic to aromatic proton NOEs. Crosspeaks were quantified by volume integration in 
Felix for 2D and 3D experiments and for the 4D ,3 C-HMQC-NOESY- l5 N-HSQC 
experiment, and from peak height measurements in the 4D l3 C-HMQC-NOESY- l3 C- 
HMQC spectra. Crosspeaks were classified as strong, medium and weak and these were 

15 assigned to distance restraints of 0 - 2.8, 0 - 3.6, and 0 - 5.5 A. Restraints from backbone 
amide signals were initially treated in this manner, and then recalibrated more precisely 
using 3D- 1:) N-ROESY-HSQC data into four classes involving maximum distances of 2.6, 
3.1, 3.6, and 4.1 A. Restraints to groups of equivalent or non-stereoassigned protons were 
treated by r" 6 summation. Most intraresidue distances (HN-Hp and H a -Hp) were converted 

20 to Xi angle restraints as described below and these distance restraints were not included in 
the final list. 

Dihedral Angle Restraints: %\ angles and stereospecific assignments of p-methylene 
protons were obtained using the grid-search program AngleSearch, with coupling constant 

25 and intraresidue ROE distance information (Polshakov et aL 9 1995). The coupling 
constant information was provided by HNHB and HN(CO)HB spectral intensities for 
3 J(HN-Hp) and 3 J(CO-Hp), and intraresidue distances (HN-Hp, H a -Hp) were obtained from 
3D ,5 N-ROESY-HSQC and 2D ROESY (D 2 0) experiments. 3 J(HN-H a ) coupling 
constants were obtained from the HNHA experiment. Residues with positive <() angles (ca. 

30 +60 degrees) were identified by large intraresidue H a crosspeak intensities in the 
HN(CO)HB experiment, and y angles near -60° degrees from strong H a( M) crosspeaks in 
the HNHB experiment. He and Leu X2 angles and Leu 5 stereoassignments were derived 
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from the LRCH experiment. Minimum ranges of 40 degrees (xi, X2) and 50 degrees (<(>, vj/) 
were used to account for errors and local dynamic effects on the coupling constants. 

Disulphide Bonding Pattern: An initial set of 20 structures was calculated by simulated 

5 annealing using approximately 550 unambiguous NOE-derived distance restraints and 36 
Xi and <|> dihedral angle restraints but with no hydrogen bonding or disulphide bond 
constraints. The Cys - Cys S Y distances in these structures were examined in order to 
establish the probable bonding pattern. Prior to the calculations, the formation of 
disulphide bridges for 4 Cys residues (Cys 12 - Cys28, Cys78 - Cys92) was already 

10 established with high probability by the observation of Hp-Hp NOEs between these pairs 
of Cys residues. Examination of the initial structures confirmed these disulphide bridges 
and also indicated a disulphide bridge between residues Cys30 and Cys41. The third 
disulphide bridge in domain- 1 (Cys7 - Cys 18) could thus be assigned by default, although 
the structure of the N-terminus was not well-defined by the NMR data. The best six 

15 structures in terms of total X-PLOR energy and violations indicated that the average Cys - 
Cys S Y distance was lowest for the disulphide bonding pattern [1-3, 2-4, 5-6] in each 
domain, and only this combination allowed all Cys residues to form contacts with a 
partner <3.5 A away. Thus, this disulphide bonding pattern was most consistent with the 
experimental data for both domains, and was imposed (initially as NOE-style distance 

20 restraints) in subsequent calculations. The [1-3, 2-4, 5-6] pattern is that expected for an 
EGF-like domain. 

Hydrogen Bonds: Non-exchanging amide groups involved in stable hydrogen bonds were 
identified in spectra of samples examined in 100 % D 2 0. The corresponding hydrogen 

25 bond acceptors were determined by examining the initial structural ensemble, using the 
Insight II and HBPlus (McDonald et al y 1994) programs, and hydrogen bond distance 
restraints were included in subsequent calculations. Further hydrogen bonds were 
identified in a similar manner in iterative calculations. Only 10 backbone hydrogen bonds 
in the antiparallel P sheets were used as restraints. Two distance restraints were used for 

30 each hydrogen bond, 1.7 - 2.3 A from proton to acceptor, and 3.0 - 3.6 A from donor 
nitrogen atom to acceptor. 
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All the structure calculations were performed following standard protocols for ab initio 
simulated annealing from an extended chain using X-PLOR version 3.843 on a Silicon 

5 Graphics Origin 200 computer. The initial calculations used an initial temperature of 
1000K, and 9000 steps of 5 fs in the restrained molecular dynamics stage. A soft-square 
potential was used for distance restraints. The SHAKE (Ryckaert et al., 1977) algorithm 
was employed during molecular dynamics to maintain correct bond lengths. Refinement 
used a square well potential for restraints, and a final slow cooling of 30000 steps of 4 fs 

10 each from 2000K. A modified "parallhdg.pro" force-field parameter set was used, with 
modifications to parameters for Arg and Pro residues, and for hydrogen bonds (Polshakov 
et al. y 1997). Force constants were 50 kcal mof 1 A" 2 for all distance restraints including 
hydrogen bonds, and 200 kcal mof 1 rad* 2 for dihedral restraints. The N-terminal sequence 
including the vector-encoded residues and (His)6 tag was excluded from the structure 

15 calculations. All peptide bonds were constrained to be trans. NOE data for all 5 Pro 
residues showed strong H a (i-i)-ProH a crosspeaks, consistent with the trans peptide 
conformation. 

Initial structures were calculated as described above to determine the disulphide bonding 
20 pattern. Then the calculation was repeated with identical NOE-derived distance and 
dihedral angle restraints, with the addition of 6 distance restraints (1.92-3.12 A) 
representing the disulphide bridges. A new set of 50 structures was obtained, from which 
the best 20 structures were selected. The criteria used for selection were that the 
structures were below the median value of both total X-PLOR energy and rms NOE 
25 difference, and had no dihedral angle violations. The resulting structures had good 
geometry and between zero and two NOE violations > 0.5 A. These structures were used 
to assign previously ambiguous NOEs and to determine the hydrogen bonds as described 
above. 

30 The final structure calculation and refinement used an expanded restraint list including 
hydrogen bonds, additional dihedral restraints, stereoassignments of P-methylene and Leu 
5 signals, and more precisely calibrated ROE data (see Table 1). A set of 100 structures 
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was obtained using this list, and 38 structures with 0-2 NOE violations > 0.5 A and no 
dihedral angle violations > 5° were accepted. These 38 structures were refined by the 
slow-cooling procedure described above, producing a final ensemble of 32 accepted 
structures with no NOE violations > 0.5 A and no dihedral angle violations > 5°. These 
5 selection criteria produced an ensemble of structures that extend to the end of the 
continuum of total potential energies in order to include structures having large scale 
correlated motions (Abseher et aL 9 1998). Statistics for the final ensemble are given in 
Table 1. Coordinates for the 32 refined structures have been deposited in the Brookhaven 
Protein Data Bank (coordinates ID code lcej; NMR restraints ID code rlcejmr). 

10 

Structures were analyzed during the calculation process using X-PLOR 3.8 (Nilges et al. f 
1991), PROCHECK-NMR/AQUA (Laskowski et al. y 1996), and Insight II for quality of 
agreement with experimental data, precision, geometry, and energy. Models were aligned 
with Insight II and fitpdb, and displayed with Insight II, MOLSCRIPT (Kraulis, 1991), 
15 and GRASP (Nicholls et a/., 1991). 

Table 1. 

A: RESTRAINTS SUMMARY 
20 Number of conformers calculated: 1 00 Number of conformers accepted: 32 
Acceptance criteria: 

No distance violation: > 0.5 A No dihedral angle violation: > 5 ° 

25 NOE/ROE distance restraints: 

Intraresidue: 73 Sequential: 222 

Medium range ( 2-4 ): 90 Long range ( > 4 ): 185 
Total: 570 

30 Dihedral angle restraints: 

phi: 25 psi: 33 chi-1 : 22 chi-2: 5 Total: 85 
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Hydrogen bonds: 10 Disulphide bonds: 6 



B: STRUCTURE QUALITY 

average +•/- s.d. 

Total X-PLOR energy (kcal mol 1 ) J 68 20 

NOE X-PLOR energy (kcal mol"') 2 1 8 



rmsd NOE 0.026 0.005 

rmsd dihedral angle 0.236 0.095 

10 rmsd bond length 0.0029 0.0002 

rmsd bond angle 0.357 0.023 

rmsd improper 0.266 0.018 



Backbone rmsd of structured region: (69 residues) 

15 Overall: 1.05 0.28 

Domain-1: 0.81 0.32 

Domain-2: 0.83 0.35 



Ramachandran plot quality (phi/psi angles): 

20 Most favoured 49.5 % 

Additional allowed 42.1 % 

Generously allowed 5.6 % 

Disallowed 2.7 % 

25 Monoclonal antibodies fmAbs) 



Anti-MSP-l[9 monoclonal antibodies used in this study were : mouse IgG mAbs 1E1, 
1E8, 2F10, 111.2, 111.4 2.2, 5.2, 7.5, 9C8, 12.8, 12.10, 12D11, 117.2, 8A12 (Holder et 
al. t 1985; McBride & Heidrich, 1987; Blackman et aL, 1987; Guevara Patirio et al., 1997); 
30 and mouse IgM mAb 5B1 (Pirson & Perkins, 1985). Of these, mAbs 12.8, 12.10 and 5B1 
are neutralising, inhibitory antibodies and 1E1, 2.2, 7.5, 9C8 and 111.4 are blocking 
antibodies. Some antibodies such as 1 1 1 .2 are neither inhibitory nor blocking. 
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Construction of modified MSP-1 m> clones 

The DNA coding the wild type MSP- 1 19 domain of Plasmodium falciparum (T9- 
5 94/Wellcome strain) MSP-1 has been cloned in expression vector pGEX-3X to produce 
MSP- 1 19 fused to the carboxy-terminus of the Schistosoma japonicum glutathione S- 
transferase (GST) in Escherichia coli (Burghaus & Holder, 1994). Site-directed 
mutagenesis of MSP-1 19 DNA sequence was done in either of two ways. 

10 The first method was a modification of the method of Perrin & Gilliland (1990) to carry 
out polymerase chain reaction (PCR)-mediated site specific mutagenesis. DNA was 
amplified using the plasmid as a template together with one oligonucleotide to introduce 
the point mutation and a 5' primer from outside of the MSP-1 19 sequence. The amplified 
product was purified after electrophoresis on an agarose gel and used in a second 

15 amplification step together with a 3' primer from outside of the other end of the MSP-1 19 
sequence and the plasmid as template. This second PCR product was digested with the 
restriction enzymes EcoRl and BamHl and the product consisting of the modified MSP- 
I.19 coding sequence was inserted back into pGEX-3X and the products were used to 
transform DH5a cells. 

20 

The second method used the QuikChange™ Site-directed mutagenesis kit from 
Stratagene. Briefly, using the plasmid pGEX-MSP-l| 9 as a template, two complementary 
synthetic oligonucleotide primers containing the desired point mutation were designed and 
were extended on the template by temperature cycling with the enzyme Pfu DNA 

25 Polymerase. This incorporation of the oligonucleotide primers results in the generation of 
a mutated plasmid containing staggered nicks in the DNA sequence. Following the 
temperature cycling, the product was treated with Dpn\ endonulease which will digest the 
methylated parental DNA template and leaves the mutation-containing newly synthesised 
DNA intact. The DNA incorporating the desired mutation was then transformed into E. 

30 coli strain DH5ct (Life technologies) competent cells where the nicks will be repaired. 

Clones were screened by analysis of restriction enzyme digests and by PCR screening of 
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the insert gene. The DNA sequence of the selected mutant clones was confirmed using a 
PerkinElmer Applied Biosystems ABI 377 automatic sequencer according to the 
manufacturer's instructions. 

5 Expression of the GST-MSP-1 jq fusion proteins 

Expression of GST-MSP-1 19 was induced with 1 mM isopropyl-p-D- 
thiogalactopyranoside (IPTG; Melford Laboratories) for 1 hour in the E. coli strain TOPP 
1 (Stratagene). The cells were then harvested by centrifiigation and the cell pellet was 

10 resuspended in cell lysis buffer (50 mM Tris-HCl/1 mM EDTA pH 8.0 containing 0.2% 
(v/v) Nonidet P40 (NP40; BDH). Phenylmethylsulphonyl fluoride (PMSF; Sigma) in 
isopropanol was added to a final concentration of 1 mM. The cell suspension was 
sonicated, on ice, using VibraCell sonicator (Sonics & Materials) at 50% duty cycle for 3 
min (six 30 sec pulses with 30 sec in between). The cell lysate was centrifuged at 65000 x 

15 g for 1 hour at 4°C. Supernatant containing soluble GST-fusion protein was applied to a 
glutathione-agrose column (Sigma) and the GST- fusion protein was eluted with 5 mM 
reduced glutathione. The eluted GST-fusion protein was dialysed extensively against 
phosphate buffered saline (PBS) at 4°C. 

20 SDS-PAGE and Western Blotting 

Proteins were analysed by polyacrylamide gel electrophoresis in the presence of sodium 
dodecyl sulphate (SDS-PAGE). Samples were solubilised in SDS-PAGE buffer without 
reducing agents, then fractionated on a homogeneous 12.5% polyacrylamide gel. The pre- 

25 stained low range molecular mass markers (24-102kDa) from Bio-Rad were used as 
markers. When required, SDS-PAGE-fractionated polypeptides were either stained with 
Coomassie Brilliant Blue R-250 (CBB; Sigma) or electrophoretically transferred to 
Optitran BA-S 83 reinforced nitrocellulose (Schleicher & Schull, 0.2 |im pore size) for 
analysis by western blotting. Blots were blocked with 5% BSA, 0.5% Tween 20 in PBS 

30 (PBS-T) for 1 h at room temperature, then washed in PBS-T. Blots were probed with first 
antibodies for 2 h at room temperature, washed 3 times in PBS-T, and then incubated in 
1/1000 dilution of horse radish peroxidase (HRP)-conjugated sheep anti-mouse IgG (H+L) 



WSDOCID: <WO 0063245A3_1A> 



WO 00/063245 PCT/G B00/0 1558 

-39- 

(ICN Immunobiologicals) or Goat anti-mouse IgM (ji chain) (Sigma) for 1 h at room 
temperature. Blots were then washed 3 times in PBS-T and developed using Super Signal 
Substrate (Pierce) as HRP substrate for 1 min. Blots were then placed in plastic wrap and 
exposed to X-ray film (XB-200, X-ograph Imaging Systems). The films were processed 
5 with an Agfa Gevamatic60 film processor (Agfa). 

Analysis of antibody-antigen interaction using a BIAcore machine 

GST-MSP-1 19 containing either the wild type or various modified sequences was used to 
10 coat a carboxymethyl dextran hydrogen sensor chip by the following methodology. The 
binding of the GST-MSP-1 19 was via amino groups using EDC/NHS chemistry. 
Immobilisation was done with the amine coupling kit (Pharmacia BIAcore). The CM 
dextran surface was activated with 50 |nl of 200 mM l-ethyl-3-(3-dimethylaminopropyl) 
carbodiimide (EDC) and 5 mM N-hydroxysuccinamide (NHS) for 10 min. GST-MSP-1 19 
15 was then coupled to the BIAcore sensor surface using 50 (al of a solution at 100 jig mf l in 
coating buffer (0.0 1M sodium acetate buffer, pH 3.5) for 10 min. Unreacted carboxyl 
groups were blocked by adding 50 jil 1 M ethanolamine, pH 8.5 for 10 min. The cells 
were washed with two pulses of 20 nl 10 mM glycine-HCl, pH 2.8 for 8 min in total to 
remove any non-covalently bound protein. The immobilisation procedure was carried out 
20 at a flow rate of 5 \il min" 1 . Measurements were performed on the BIAcore 2000 
instrument. 

RESULTS 

25 Example 1 - Resonance assignments, NMR restraints and structure determination 

The assignments and restraints were obtained as described in Materials and Methods using 
a range of multidimensional heteronuclear experiments with l3 C/ ,5 N uniformly labelled 
protein. Sample spectra from 3D and 4D experiments showing NOE connections to the 
30 Lys35 backbone amide NH proton, resolved and unambiguously assigned using the l3 C 
chemical shift information, are shown in Figure 2. The distance, dihedral angle and 
hydrogen bond restraints used in the final set of structure calculations are summarized in 
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Table 1. A total of 570 unambiguously assigned distance restraints, 85 dihedral angle 
restraints, and 10 hydrogen bonds were used in the final set. The assignments and restraint 
list shown in Table A have been submitted to the BioMagResBank database. Three 
disulphide bonds, with the (1-3, 2-4, 5-6) pattern for each domain were experimentally 

5 determined from the NMR data in preliminary calculations as described in Materials and 
Methods, and these were also included in the final refinement. A final set of 32 models 
was calculated and refined using these restraints and these structures are shown in Figure 
3 superimposed on the backbone of the representative structure S rep . Table 1 shows that 
all 32 models have good geometry and are in good agreement with the experimental data 

10 with no NOE violations > 0.5 A and no dihedral angle violations > 5°. The atomic rmsd 
value for the backbone atoms of the well-structured region (residues 1 5 - 64, 74 - 92) is 
1 .05 A (see Table 1). The local backbone rmsd is highest at the N-terminus (up to Cysl2), 
in the loop Glu65 - Lys73, and following Cys92 at the C-terminus. The Ramachandran 
plot quality is typical of that found for other EGF structures (Doreleijers et al. y 1998). 

15 

Description of the structure 

EGF-domains 

Analysis of the final ensemble by PROCHECK-NMR indicated that each domain contains 
20 a major stretch of antiparallel (3-sheet containing the third and fourth Cys residues of each 
domain, as expected for an EGF-like fold, as well as an additional minor antiparallel 
P -sheet at the C-terminal end of domain- 1, similar to some (but not all) EGF family 
members. These secondary structure features, together with the disulphide bonding 
patterns, can be seen in Figure 4. There is also a well-defined type II tight turn in domain- 
25 1 , with a hydrogen bond from Tyr 34NH proton to Leu3 1 carbonyl oxygen. The normally 
conserved EGF consensus Gly residue in the tight turn is replaced in domain- 1 by a 
residue with a positive <J> angle (Asn33), while the conserved aromatic residue is present 
(Tyr34). There is a probable hydrogen bond between Leu 31NH proton to Asnl5 carbonyl 
oxygen. Domain-2 contains two turns preceding the major p-sheet, (Asn53 - Cys56, 
30 Asp57 - Ala60), and a final bend from Leu86 - Phe91 with a probable hydrogen bond 
from Asp57 NH proton to the carbonyl oxygen of Ile90 or Gly89. A surface-exposed loop 
from Pro81 to Pro85 replaces the tight turn, while the aromatic residue is not conserved. 
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The large loop at the end of the major b-sheet (Glu65 - Lys73) is relatively disordered, and 
high mobility for the segment Gly68 - Gly7l was confirmed by backbone amide l5 N{ l H} 
heteronuclear NOE measurements (Barbato et al, 1992). The heteronuclear NOE values 
are dramatically reduced for residues in this region. At the N-terminus: the low NOE 
5 intensities correspond to increased mobility compared with the rest of the protein. The 
interdomain linker region from Pro45 to Pro47 is distinct from other EGF-like module 
pairs. The conformations of the disulphide bridges between Cys30 - Cys41 in domain- 1, 
and the three Cys - Cys bonds in domain-2 are all left handed spirals (Richardson, 1981). 
Bridges between Cys30 - Cys4l, Cys56 - Cys76, and Cys78 - Cys92 are particularly close 
10 to their equivalents in the blood coagulation factor Xa structure (lhcg). The conformations 
of the first two disulphide bridges in the relatively disordered N-terminal segment of 
domain- 1 were not determined. 

Figure 5 shows the backbone C, N, C a atom alignments of the two MSP-1 C-terminal 
15 fragment domains made with typical examples of EGF-like domains from several 
proteins, using the fitpdb program. Pairwise alignments showed that the two domains 
from MSP-1 are more similar to the factor Xa structure and its close relative from Clr, 
than to each other or to the other structures tested. The rmsd values for MSP-1 domains 
compared to factor Xa are comparable to those of the more distantly related structures 
20 fibrillin- 1 and transforming growth factor-ct. 

The overall fold of each MSP-1 domain is thus similar to typical EGF family members, 
with the turns following the fifth Cys residue roughly equivalent, in spite of the 
divergence from the EGF consensus (C(5)xxGa) where a is a Phe or Tyr -residue. 
25 Although some of the external loops are disordered, the scaffold is quite stable, as 
indicated by the non-exchangeable backbone amides (see above and in Protein Data 
Bank/BioMagResBank submission for details). 

Unlike many EGF-like domains such as fibrillin- 1, the MSP-1 C-terminal fragment lacks 
30 the conserved EGF Ca 2+ -binding sequence and there was no evidence of Ca 2 * binding to 
the MSP-1 C-terminal fragment. The 2D 'H-NOESY spectra were virtually identical in 
the absence or presence of 20 mM CaCl 2 , indicating that any binding that might occur has, 
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Domain interface and surface 

The most striking feature of the MSP-1 C-terminal fragment structure is the interface 
5 between the domains, which consists of several nonpolar amino acids (Phel9, Leu31, 
Leu32, Leu86, Phe87, Ile90 and Phe91) involved in hydrophobic interactions. These 
residues join the base of the major p-sheet and the tight turn in domain-1 with the final 
bend from residue 86 to 91 in domain-2. The domain interactions result in the domains 
forming a U-shaped structure which contrasts with structures observed for other pairs of 

10 EGF domains (Downing et al. 9 1996; Brandstetter et a/., 1995). For example, in fibrillin- 1, 
the interface between EGF domains 32 and 33 is largely formed by a shared Ca 2+ ligation 
site (Downing et al. y 1996), and the overall structure resembles a rigid rod, with distant N- 
and C-termini. This contrasts with MSP-1 where the EGF-like domains are folded against 
each other so that their termini are relatively close together. A comparison of fibrillin- 1 

15 and MSP-1 EGF module pairs is shown in Figure 6. Although both termini of the MSP-1 
C-terminal fragment are somewhat disordered, NOE contacts were observed between 
nuclei in the two ends. The proximity of the C- and N-terminal positions may be 
significant, since it suggests that the proteolytic processing site that produces the C- 
terminal 96 amino acid fragment may be very close to the GPI membrane attachment site 

20 at or near residue 96. This proximity is consistent with the idea that a membrane-bound 
Plasmodium proteinase is responsible for secondary processing. 

The electrostatic potential surface of the MSP-1 C-terminal fragment is shown in two 
views in Figure 7. The surface in Figure 7a is highly charged, especially in the protruding 

25 loop regions 23 - 27, 35 - 40 and 64 - 66. The surface in Figure 7b contains more neutral 
hydrophilic residues as well as a small hydrophobic patch from Pro85 - Phe87 near the 
center of the surface. In the future, such information could assist in understanding how 
these different surfaces may be involved in interactions with the rest of the MSP-1 
precursor, the processing proteinase, other proteins on the merozoite surface, or unknown 

30 targets on the erythrocyte or parasite vacuolar membrane surfaces. 
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Primary sequence conservation 

The residues involved in the hydrophobic domain interface in P. falciparum are also 
shown in Figure 1, together with corresponding residues in MSP-1 of the less virulent 

5 human malaria parasite, P. vivax (Del Portillo et al., 1991; Gibson et al., 1992). 
Extensive conservation of the interface residues (with conservative substitutions) suggests 
that P. vivax and perhaps other Plasmodium species as well, may have a similar U-shaped 
EGF module pair arrangement. Another feature of the P. vivax sequence, also seen in 
other Plasmodium species, is the single disulphide bond deficiency in the first EGF-like 

10 domain resulting from the absence of cysteine residues equivalent to the P. falciparum 
Cysl2 and Cys28. 

P. falciparum dimorphic sites 

15 Five dimorphic sites have been observed in the P. falciparum MSP-1 19 C-terminal 
fragment from different isolates (Qari et al., 1998). Several observations can be made 
about the position of these sites on the MSP-1 structure. Two sites, Glnl4/Glul4 and 
Lys61/Thr61, involve residues in relatively well-structured backbone regions, with 
surface-exposed hydrophilic or charged side-chains. A pair of adjacent sites, with the 

20 sequence variants Asn70 - Gly71/Ser70 - Arg71, occurs in the disordered loop of domain- 
2, within a segment (residues 68 - 71) that has been shown to be highly mobile. The 
region from Glu65 to Lys73 also appears to be the most variable region among different 
Plasmodium species (Daly et al, 1992; Holder et al., 1992). Finally, the fifth site has a 
substitution between hydrophobic residues (Leu86/Phe86). This partially-exposed side- 

25 chain is located at the hydrophobic domain interface, and the conservative substitution is 
consistent with a role in this interaction. 

Example 2 - Mutation and Monoclonal Antibody Binding studies 

30 As a step towards understanding antibody interactions with the MSP-1 C-terminal 
fragment, the effect of engineered point mutations (within domain- 1) on antibody binding 
has been studied. Amino acid substitutions were made that consisted of radical changes. 
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These radical changes consisted of, for example, replacing an aliphatic residue with a 
charged polar residue, replacing a positively charged side chain with a negatively charged 
side chain, replacing an amino acid with a large side chain with an amino acid with a 
smaller or no side chain (glycine), replacing a polar amino acid with a charged polar 
5 amino acid, replacing a polar amino acid with an aromatic amino acid, replacing a large 
aromatic amino acid with an amino acid with a small side chain, and replacing cysteine 
residues that are involved in disulphide bonds. 

Four individual amino acid substitutions shown in Figure 8, each completely abolish 
10 binding of one or more mAbs to the mutant fragment, as detected by Western blotting. 
The Glu26 mutation, shown in cyan, is closest to the N-terminal proteolytic processing 
site (magenta) at Asnl, and is the only one of this group of mutations that affects binding 
of a processing-inhibitory antibody, i.e. one that is capable of preventing both proteolytic 
processing of the MSP-1 precursor and erythrocyte invasion in vitro. The other three 
15 mutations abolish binding of blocking antibodies that bind to the native C-terminal 
fragment and interfere with the binding of processing-inhibitory antibodies. 

Additional mutations were made based on the immunochemical analyses and the tertiary 
structure of the molecule, and the binding of the mAbs was assessed by western blotting 
20 and BIAcore analysis. The results are summarised in Table 2. The results of the binding 
of selected mAbs to the modified proteins as detected by Western blotting are shown in 
Figure 9, and by BIAcore analysis in Figure 10. Some individual amino acid changes have 
no effect on the binding of any of the mAbs tested (for example Leu22 to Arg). Other 
substitutions affect the binding of one or more mAbs. 

25 

Of particular interest are those changes that prevent the binding of blocking antibodies but 
have no effect on the binding of the inhibitory antibodies. For example, replacement of 
Asnl 5 by Arg prevents the binding of mAb 7.5, replacement of Glu27 by Tyr prevents the 
binding of mAb 2.2, replacement of Leu31 by Arg prevents the binding of mAb 1E1, 
30 replacement of Tyr34 by Ser prevents the binding of mAb 7.5, and replacement of Glu43 
by Leu prevents the binding of mAb 1 1 1 .4. 
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Several combinations of substitutions that prevent the binding of blocking antibodies but 
do not affect the binding of inhibitory antibodies were made in single proteins (Table 2 
and Figure 1 1). In the first Glu27->Tyr, Leu31->Arg and Glu43-*Leu were combined, in 
the second Glu27->Tyr, Leu31-»Arg, Tyr34->Ser, and Glu43->Leu were combined, and 
5 the third Asnl5-»Arg, Glu27->Tyr, Leu31->Arg and Glu43->Leu were combined. None 
of these modified proteins bound any of the blocking antibodies but continued to bind the 
inhibitory antibodies. We propose that the mutant proteins will induce a polyclonal 
response that is more inhibitory than that induced by the wild type protein. 

10 The modified recombinant proteins will also be used to affinity select antibodies from 
pooled serum from individuals exposed to malaria. We hypothesise that the modified 
proteins will select less blocking antibody than the wild type protein and that therefore 
these selected antibodies will be more effective in inhibiting parasite invasion in vitro and 
secondary processing. 

15 

In the first EGF-like domain of MSP-1 from the rodent, primate and P. vivax malaria 
parasites, cysteines 2 and 4 are not present. We have replaced this cysteine pair (Cysl2 and 
Cys28) in the P. falciparum protein. This does not have appear to have any effect on the 
binding of any of the inhibitory antibodies, but does abolish the binding of the blocking 

20 antibody mAb 2.2. We propose that one reason why the proteins from these other malaria 
parasites are more immunogenic is that T cell recognition is more effective or that 
processing by antigen processing cells proceeds by a different degradation pathway that 
drives the fine specificity of the antibody response in a more productive direction (see for 
example Egan et aL, 1997). Removal of the cysteine pair may improve the 

25 immunogenicity of the modified protein and this will be assessed by measuring the level 
of antibodies induced by the P. falciparum protein without the two cysteines with the level 
of antibodies induced by the wild type protein. 
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TABLE A 

u 

# 13-10-98 

U merozoite surface protein- 1 (MSP-1) Plasmodium falciparum (C-terminal fragment) 

# Reference: 'H : DSS=0.000 dioxane=3.755 (internal) 
U ,5 N: indirect 

# ,3 C: indirect 

U 25C pH6.5 50mMNaPO4 lOOmM NaCI 90%H2O/10%D2O 

# —FORMAT— 

# BioMagResBank 
# 

# The original sequence entered was: 
# 

NISQHQCVKKQCPQNSGCFRHLDEREECKCLLNYKQEGDKCVENPNPTCNENNGGCDADAKCTEEDSGSNGK 

KITCECTKPDSYPLFDGIFCSSSN 

H 

# Expressed in N MR- STAR, this sequence is: 
_Mol_residue_sequence 

NI SQHQCVKKQC PQNSGCFR 
HLDEREECKCLLNYKQEGDK 
CVENPNPTCNENNGGCDADA 
KCTEEDSGSNGKKITCECTK 
PDSYPLFDGIFCSSSN 

l°°P_ 

_Residue_seq_code 
_Residue_author_seq_code 
Residue label 



1 


0 


ASN 


2 


Q 


ILE 


3 


0 


SER 


4 


0 


GLN 


5 


0 


HIS 


6 


0 


GLN 


7 


0 


CYS 


8 


0 


VAL 


9 


0 


LYS 


10 


0 


LYS 


11 


0 


GLN 


12 


0 


CYS 


13 


0 


PRO 


14 


0 


GLN 


15 


0 


ASN 


16 


0 


SER 


17 


0 


GLY 


18 


0 


CYS 


19 


0 


PHE 


20 


0 


ARG 


21 


0 


HIS 


22 


0 


LEU 


23 


0 


ASP 


24 


0 


GLU 


25 


0 


ARG 


26 


0 


GLU 


27 


0 


GLU 


28 


0 


CYS 


29 


0 


LYS 


30 


0 


CYS 


31 


0 


LEU 


32 


0 


LEU 


33 


0 


ASN 


34 


0 


TYR 


35 


0 


LYS 


36 


0 


GLN 


37 


0 


GLU 


38 


0 


GLY 


39 


0 


ASP 


40 


0 


LYS 


41 


0 


CYS 


42 


0 


VAL 


43 


0 


GLU 


44 


0 


ASN 


45 


0 


PRO 


46 


0 


ASN 


47 


0 


PRO 


48 


0 


THR 


49 


0 


CYS 


50 


0 


ASN 


51 


0 


GLU 


52 


0 


ASN 


53 


0 


ASN 


54 


0 


GLY 


55 


0 


GLY 


56 


0 


CYS 


57 


0 


ASP 


58 


0 


ALA 


59 


0 


ASP 


60 


0 


ALA 


61 


0 


LYS 


62 


0 


CYS 


63 


0 


THR 


64 


0 


GLU 


65 


0 


GLU 


66 


0 


ASP 


67 


0 


SER 


68 


0 


GLY 


69 


0 


SER 


70 


0 


ASN 


71 


0 


GLY 


72 


0 


LYS 


73 


0 


LYS 


74 


0 


ILE 


75 


0 


THR 


76 


0 


CYS 


77 


0 


GLU 


78 


0 


CYS 


79 


0 


THR 


80 


0 


LYS 


81 


0 


PRO 


82 


0 


ASP 


83 


0 


SER 


84 


0 


TYR 


85 


0 


PRO 


86 


0 


LEU 


87 


0 


PHE 


88 


0 


ASP 


89 


0 


GLY 


90 


0 


ILE 


91 


0 


PHE 


92 


0 


CYS 


93 


0 


SER 


94 


0 


SER 


95 


0 


SER 



96 0 ASN 
stop_ 
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# 

It 


Chemical Shift Ambiguity Code Definitions 


u 

ft 

tt 


ff 

# 

it 


Codes 


Definition 


tt 

# 




1 


Unique 


a 

ff 


# 


2 


Ambiguity of geminal atoms or geminal methyl 


u 
ft 






proton groups 


ft 


# 


3 


Aromatic atoms on opposite sides of the ring# 








(e.g. Tyr HE1 and HE2 protons) 




# 


4 


Intraresidue ambiguities (e.g. Lys HG and 


n 


ff" 




HD protons) 


# 


# 


5 


Interresidue ambiguities (Lys 12 vs. Lys 27) 


# 




9 


Ambiguous, specific ambiguity not defined 


tt 


# 




# 



########################################################## 



# INSTRUCTIONS 

# 1) Replace the @-signs with appropriate values. 

# 2) Text comments concerning the assignments can be supplied in the full deposition. 

# 3) Feel free to add or delete rows to the table as needed. 

# The row numbers (_Atom_shift_assign_ID values) will be re-assigned to sequential values by BMRB 

# # The atom table chosen for this sequence is: 
loop_ 

_Atom_shift_assign_ID 
_Residue_seq_code 
_Residue_label 
_Atom_name 
Atomtype 
_Chem_shift_value 
_Chem_shift_value_error 
_Chem_shift_ambiguity_code 

# 

Table Al. Supplementary: 1H, 13C and 15N chemical shift assignments of MSP-1 
C-terminal fragment 

Atom Residue 

shift Seq Residue Atom Atom Shift/ Error/ Ambiguity- 

assign no. Name Name Type ppm ppm Code 



1 




ASN 


H 


H 


8.29 


0.02 


1 


2 




ASN 


HA 


H 


4.60 


0.02 


1 


3 




ASN 


HB2 


H 


2.86 


0.02 


2 


4 




ASN 


HB3 


H 


2.75 


0.02 


2 


5 




ASN 


HD21 


H 


? 


9 


7 


6 




ASN 


HD22 


H 


? 


? 


? 


7 




ASN 


C 


C 


9 


? 


? 


8 




ASN 


CA 


C 


55.5 


0.6 


1 


9 




ASN 


CB 


C 


40.9 


0.6 


1 


11 




ASN 


N 


N 


125.8 


0.3 


I 


12 




ASN 


ND2 


N 


? 


? 


? 


13 


2 


ILE 


H 


H 


8.29 


0.02 


1 
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1 A 

14 


-> 


lit? 

ILE 


If A 

HA 


15 


2 


ft XT 

ILE 


Lin 


16 


2 


ii f 
ILE 


rlu \jL 


17 


2 


it r* 

ILE 


LI/" 1 1 1 

HO 1 J 


1 o 

IS 


2 


TIC 

ILE 


urn 
nOZ 


19 




lit? 

ILL 


HUI 


20 




ILE 


C 


2 1 


2 


II c 

ILE 


OA 


22 


2 


it r* 

ILE 


IB 


23 


2 


ii t? 
ILE 


CO 1 


24 


2 


Hi? 
ILE 


C02 


25 


2 


ii n* 

ILE 


tu I 


26 


2 


ILE 


VI 

N 


27 


3 


SER 


H 


28 


3 


SER 


U A 

HA 


29 


3 


SER 


urn 

HB2 


30 


3 


SER 




32 


3 


SER 


c 


33 


3 


SER 


CA 


34 


3 


SER 


CB 


35 


3 


SER 


XI 

N 


36 


4 


GLN 


H 


37 


4 


GLN 


ir A 

HA 


38 


4 


GLN 


HB2 


39 


4 


GLN 


HB3 


40 


4 


GLN 


HG2 


41 


4 


GLN 


HGj 


42 


4 


GLN 


HE21 


43 


4 


GLN 


LI Ml 

HE22 


44 


4 


GLN 


C 


45 


4 


GLN 


CA 


46 


4 


GLN 


CB 


47 


4 


GLN 


Co 


49 


4 


GLN 


N 


50 


4 


GLN 


NE2 


51 


5 


HIS 


H 


52 


5 


HIS 


t r a 

HA 


53 


5 


HIS 


HB2 


54 


5 


HIS 


HBj 


56 


5 


HIS 


HD2 


57 


5 


HIS 


HE1 


59 


5 


HIS 


C 


60 


5 


HIS 


CA 


61 


5 


HIS 


CB 


65 


5 


HIS 


N 


68 


6 


GLN 


H 


69 


6 


GLN 


HA 


70 


6 


GLN 


HB2 


71 


6 


GLN 


HB3 


72 


6 


GLN 


HG2 


73 


6 


GLN 


HG3 


74 


6 


GLN 


HE21 


75 


6 


GLN 


HE22 


76 


6 


GLN 


C 


77 


6 


GLN 


CA 


78 


6 


GLN 


CB 


79 


6 


GLN 


CG 


81 


6 


GLN 


N 
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50- 








H 


4.25 


0.02 




H 


1.97 


0.02 




H 


1.39 


0.02 




H 


1.19 


0.02 




H 


0.92 


0.02 




H 


0.81 


0.02 




C 


173.8 


0.6 




C 


62.2 


0.6 




C 


38.7 


0.6 




C 


27.5 


0.6 




C 


18.2 


0.6 




C 


13.7 


0.6 




N 


121.1 


0.3 




H 


8.47 


0.02 




H 


4.20 


0.02 




H 


3.90 


0.02 




H 


3.90 


0.02 




C 


7 






c 


60.9 


0.6 




c 


63.3 


0.6 




N 


119.3 


0.3 




H 


8.32 


0.02 




H 


4.02 


0.02 




H 


1.88 


0.02 




H 


1.88 
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stop_ 

U The following loop is used to define sets of Atom-shift assignment IDs that represent related ambiguous 
assignments taken from the above list of assigned chemical shifts. Each element in the set should be 
separated by a comma, as shown in the example below, and is the assignment ID for a chemical shift 
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assignment that has been given as ambiguity code of 4 or 5. Each set indicates that the observed chemical 
shifts are related to the defined atoms, but have not been assigned uniquely to a specific atom in the set. 

loop_ 

_Atom_shiftassign_lD_ambiguity 

# 

# Sets of Atom-shift Assignment Ambiguities 

U 

# Example: 5,4,7 
# 

@ 

stop_ 
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m — REMARKS — M 

umnmmmumnmnmmmmnmmmmnnmmunummnmumnmununmummmuum 
# 

# PROTECTED BACKBONE AMIDE GROUPS 

U (SLOWLY EXCHANGrNG IN D20) FOR RESIDUES: 

# GLY 17 , PHE 19 , GLU 27 , LYS 29 , LEU 31 , 

# TYR 34 , LYS 35 , VAL 42 , CYS 56 , ASP 57 , 

# ALA 60 , LYS 61 , THR 63 , THR 75 , GLU 77 , 
U LEU 86 , GLY 89 , ILE 90 , PHE 91 

# 

U BROAD HN SIGNALS IN [15-NJ-HSQC OBSERVED FOR RESIDUES: 
U VAL 8 , LYS 9 , LYS 10 , CYS 18 , ARG 20 

n 

U TWO BACKBONE HN CROSSPEAKS OBSERVED FOR RESIDUES: 
U HIS 5 : 7.78,1 13.8/ 7.74,1 13.5 
n GLN 6 : 7.44,122.6 / 7.40,122.4 
# 

# TWO AVERAGED NHVHH* SIGNALS OBSERVED FOR RESIDUES: 

# ARG 20 , ARG 25 : NOT SPECIFICALLY ASSIGNED 
U TO INDIVIDUAL ARGININES 

# 

# LYS 29 NZmZ* SIGNAL: TENTATIVELY ASSIGNED TO 
U LYS 29 (BURIED LYSINE SIDE CHAIN) 

U BASED ON GREATER PROTECTION FROM H20 EXCHANGE 

# THAN OTHER LYSINE NZ/HZ* SIGNALS 
# 

# ASPARAGINE SIDE CHAIN AMIDE SIGNALS: 

# PROBABLE OVERLAPPING CROSSPEAKS -1 12 PPM [ ,5 N] 
U FOR ASN 1 , ASN 70 , ASN 96 

n 

# 

u # 
mimmm^nfmimmmimmmmmmmmmimm^ 
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TABLE B 



merozoite surface protein-1 (MSP-1) P. falciparum C-terminal fragment 

X-PLOR format 

09-11-98 

noes + roes --approximate-- 570 total 

long-range 185 
medium-range 90 
sequential 222 
intraresidue 73 

hydrogen__bonds 10 (20 restraints) 

pseudoatom corrections not used — for R-6 averaging/summation 
AVERAGING: 
class rt6s SUM 
class nsam SUM 



class sing 
class hbnd 
types : 

arom_ aromatic pair 

metn _ methyl 

dgnm_ degenerate methylene 

nsam_ non-stereospecif ically-assigned methylene 

sing_ single proton 

_1 long-range (j-i > 4) 

_m medium-range (j~i =2-4) 

_s sequential (j-i = 1) 

_i intraresidue 

hbnd hydrogen_bonds 



! <residue-atom 1> <residue-atom 2> <dist-minus-plus> 

<type> 

class rt6s 



assign 


(resid 


5 


and 


name 


hb#) 


(resid 


19 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


1 


assign 


(resid 


5 


and 


name 


hb#) 


( resid 


19 


and 


name 


he#) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


1 


assign 


(resid 


15 


and 


name 


ha) 


(resid 


34 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom" 


"l 


assign 


(resid 


15 


and 


name 


ha) 


(resid 


34 


and 


name 


he#) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


"l 


assign 


(resid 


17 


and 


name 


ha#) 


(resid 


87 


and 


name 


hd#) 


5. 


5 


5. 


5 


0. 


0 


! arom_ 


"l 


assign 


(resid 


17 


and 


name 


hat) 


( resid 


87 


and 


name 


he#) 


5. 


5 


5. 


5 


0. 


0 


! arom" 


~1 


assign 


(resid 


19 


and 


name 


hd#) 


( resid 


20 


and 


name 


hn) 


5. 


5 


5. 


5 


0. 


0 


! arom_ 


_s 


assign 


(resid 


19 


and 


name 


hd#) 


( resid 


21 


and 


name 


hn) 


5. 


5 


5. 


5 


0. 


0 


! arom_ 


s 


assign 


(resid 


19 


and 


name 


hd#) 


( resid 


21 


and 


name 


hd2) 


3. 


6 


3. 


6 


0. 


0 


! arom 


_m 


assign 


( resid 


19 


and 


name 


hd#) 


( resid 


22 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


m 


assign 


(resid 


19 


and 


name 


hd#) 


( resid 


86 


and 


name 


hg) 


5. 


5 


5. 


5 


0. 


0 


! a-rom 


"1 


assign 


(resid 


19 


and 


name 


hd#) 


( resid 


91 


and 


name 


hb#) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


"l 


assign 


(resid 


19 


and 


name 


hd#) 


( resid 


91 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


"l 


assign 


( resid 


19 


and 


name 


hd#) 


(resid 


91 


and 


name 


he#) 


5. 


5 


5. 


5 


0. 


0 


! arom 


"1 


assign 


(resid 


19 


and 


name 


he#) 


(resid 


21 


and 


name 


hd2) 


5. 


5 


5. 


5 


0. 


0 


! arom_ 


_m 


assign 


(resid 


19 


and 


name 


he#) 


(resid 


22 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom] 


m 


assign 


(resid 


19 


and 


name 


he#) 


( resid 


86 


and 


name 


hg) 


5. 


5 


5. 


5 


0. 


0 


! arom_ 


"l 


assign 


(resid 


19 


and 


name 


he#) 


(resid 


91 


and 


name 


hb#) 


3. 


6 


3. 


6 


0. 


0 


! arom" 


"1 


assign 


(resid 


19 


and 


name 


he#) 


(resid 


91 


and 


name 


hd#) 


5. 


5 


5. 


5 


0. 


0 


! arom 


[l 


assign 


( resid 


31 


and 


name 


hn) 


( resid 


34 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom 


m 


assign 


(resid 


31 


and 


name 


hg) 


( resid 


34 


and 


name 


hd#) 


5. 


5 


5. 


5 


0. 


0 


! arom 


[m 


assign 


(resid 


31 


and 


name 


hd#) 


(resid 


34 


and 


name 


hd#) 


5. 


5 


5. 


5 


0. 


0 


! arom" 


_m 


assign 


(resid 


31 


and 


name 


hdl#) 


( resid 


34 


and 


name 


he#) 


5. 


5 


5. 


5 


0. 


0 


! arom" 


m 


assign 


(resid 


31 


and 


name 


hd2#) 


(resid 


87 


and 


name 


hd#) 


2. 


8 


2. 


8 


0. 


0 


! arom_ 


"l 


assign 


( resid 


32 


and 


name 


hn) 


(resid 


34 


and 


name 


hd#) 


5. 


5 


5. 


5 


0. 


0 


! arom" 


"m 


assign 


(resid 


34 


and 


name 


hn) . 


(resid 


34 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom" 


> 


assign 


(resid 


34 


and 


name 


ha) 


( resid 


34 


and 


name 


hd#) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


_i 


assign 


(resid 


34 


and 


name 


hd8) 


(resid 


35 


and 


name 


hn) 


3. 


6 


3. 


6 


0. 


0 


! arom_ 


s 



<WO 0063245A3_IA> 



WO 00/063245 



PCT/GB00/01558 



^ o o i on 


( r €S id 


34 


and 


name 


hd# ) 


( resid 


a c c \ an 


( res id 


34 


and 


name 


hd# ) 


( resid 




( re s id 


34 


and 


name 


he# ) 


( resid 


2 c c i on 


( resid 


34 


and 


name 


hd# ) 


( resid 


a c c i on 


( rpciH 

\ -i- CoJLU 


14 


and 


name 


he# ) 

* ic; tx 1 


( res id 


^ c c i on 

a o j XL) 1 1 


( res id 


34 


and 


name 


he# ) 


( resid 


a e c i on 
a Jiiyii 


\ i coiu 


34 


a nd 


name 


licir / 


( res id 


a cci on 


( r6sid 


34 


and 


name 


he# ) 


( res id 


3 c c i on 
a o o x y i i 




8 3 


a nd 


n ame 


hn ) 


( re s id 


-\ 1 fl o 

a b o lyil 


( rnci H 


0 0 


a 1 iu 


n ame 


hn ) 


( res id 


n o i et T"l 


( r e s id 


ft ft 
0 0 


and 


name 


IlU ft } 


( resid 


^ o e -i /"f f*\ 


( r e s id 


ft 


and 


n ame 


nun) 


/ £• q 2 id 


O C? O T /~T 

dssiyii 




ft 


a nH 
a 1 1U 


n amo 


i 1V<1 IT / 


/ rp<; i H 


a e o i nr\ 


V x c jiu 


8 6 


a 1 1U 




1 iu tr / 


( res id 


s c c i nn 
d joiyil 


/ rpc l H 


ft 7 


a n H 


1 1 dill C. 


1 ia / 


( res id 


-a o o "J on 


( r e s i d 


87 


and 


1 1 Cl J LH3 


1 tr / 


( res id 


3 s s x^n 


( r ss id 


87 


and 


name 


hd# ) 


( resid 




( r e s i d 


87 


and 


name 


1 1U ff / 


{ resid 


o <r o i on 
ctooiyu 


( rGsid 


87 


and 


name 


hd# ) 


( res id 


3 <^ o i on 


( re s id 


87 


and 


name 


hd# ) 


{ res id 




( r e s id 


ft 7 


and 


n ame 


l lvJ If ^ 


( Y"0 C 1 H 

V x e a x 




/ y- 0 C i O* 


ft7 


a 1 1U 


j-i o m 0 
1 1 diuc 


I lc rr / 


( res id 


ass ign 


( res id 


ft 7 


and 


name 


ne # ; 


\ I Co IU 


ass ign 


( r es id 


9 1 


and 


name 


hnl 
11x1/ 


\ 1 c 0 1 u 


a s sign 


( res id 


Q1 

J? J. 


and 


name 


hH ft ) 




a s siyn 


( r e s id 


z. 


and 


name 


hH ^ 


/ rPC i H 


a s s iofn 


{ r e s i d 


c; 


and 


name 


nuft / 


\ X C S X*wl 


as siyn 


( r e s id 


1 7 

X / 


and 


name 


1111 / 




as siQn 


( T* A Q "i H 


1 7 

X / 


and 


name 


h^ # ) 
1 id tr ) 


{ resid 


as s xyn 




? Q 


a r\ w 


name 


1 iu tr / 


( resid 


ass iyii 


/ C "i H 


?Q 


a n H 


name 


hat ) 
uy tr ; 


( resid 


3 c 0 i on 
oooiyii 


f rPCi H 

\ i CO 1U 


?9 

<!. 7 


3 n H 
allU 


n a mo 


hH# ) 
1 iu tr ) 


( resid 


dooiyil 


/ roe i H 


?Q 

^. -7 


a n H 


n ama 


1 ic tr / 




assign 


( resid 




and 


name 


hn \ 
nn ) 


{ T* 0 c 1 H 


a s siyn 


( res id 


0 1. 


and 


name 


hA 1 
na ) 




assign 


( resid 


-3 1 
J. 


and 


name 


hA 1 
1 10 ) 


\ X C O XU 


as siyn 


( resid 




and 


name 


hA ^ 
i Id J 


^ r" 0 ci H 


ass iy n 


( resid 


•J 


and 


name 


hHtt ^ 
riutr / 


( r e s x d 


ass ign 


( resid 




and 


name 


hoi 

ng ; 




a c c A nr\ 
dboiyil 


{ resid 


4 fi 


OllU 


name 


hn \ 


f rpci H 
V X coxu 


a c c i n r> 
dooiyu 


{ r q c i H 


4 7 


ailU 


name 


hA ) 

l Id / 


( resid 


assign 


/ T*tf» C 1 H 
\ IcolU 


4 7 


and 


name 


hHI 1 

i IU X J 


^ J.C31U 


a 0 0 i on 
0 0 0 x y 1 1 


( y <a c i H 


4 7 


a nH 


1 LdlllC 


ho# ) 
ny tr ; 


( resid 


a c c 1 on 


f r <a Q i H 


4 8 


a nd 


name 


hn ) 


( resid 


□ c c i n n 
ajoxyii 




4 ft 


0 uu 


name 


hn 1 


ZrPQiH 


ass 1 gn 


( resid 


4 ft 

*i O 


and 


name 


hA \ 
lid J 


^ ICdlU 


3 O C i f~* Y\ 

dooiyil 


( resid 


4 ft 


and 


name 


1 iy tr ; 


/ jjgg id 


assign 


( res id 


4 ft 


and 


name 


ho9# 1 
ng £. tr ; 


( T~ « C 1 H 
^ I C J IU 


assign 


( res id 


4 ft 


and 


name 


ho? ft 1 
1 iy c. tr / 


( resid 


assign 


( resid 


4 ft 


and 


name 


ho? ft 1 
ny c.rt ) 


\ ICO IU 


a c; c S O 


( y-QC i H 
V i CO 1U 


4 ft 


and 


name 


ho? ft \ 
1 iy £,ttt 




a c c i /-t »-» 
oooiytl 


( resid 


4 ft 


and 


name 


ho? ft \ 
ny ^ tr 7 


\ICOlU 


acci on 
aooiyii 


V X coxu 


4 ft 


and 


name 


ho? ft 1 
i iy «. tr / 


( res id 


ass 1 gn 


/ y 0 c i H 
\ IcolU 


4 ft 


and 


name 


ho? ft 1 
ny £• tr j 




assign 


( resid 


4 Q 


and 


name 


hn I 


( resid 


a c c 1 on 
d 0 0 x y 1 1 




4 Q 


and 


name 


hA 1 
na ] 


\ X C S X vi 


ass 1 gn 


( res id 


^4 


and 


name 


hnl 
ill 1 / 


( resid 


assign 


( re s id 


54 


and 


name 


ha# ) 


( resid 


assign 


( resid 


54 


and 


name 


ha#) 


(resid 


assign 


{ resid 


56 


and 


name 


hn) 


(resid 


assign 


(resid 


56 


and 


name 


hn) 


(resid 


assign 


( resid 


56 


and 


name 


ha) 


(resid 


assign 


(resid 


56 


and 


name 


ha) 


(resid 


assign 


(resid 


56 


and 


name 


hbl) 


(resid 


assign 


( resid 


57 


and 


name 


hn) 


(resid 



-74. 



42 


and 


name 


hn) 


5 . 


5 


5 . 


5 


0 


0 


! arom 1 


42 


and 


name 


ha ) 


3 . 


6 


3 . 


6 


0 


0 


! arom 1 


43 


and 


name 


hn ) 


5 . 


5 


5 . 


5 


0 


0 


! arom 1 


43 


and 


name 


hb#) 


5 . 


5 


5 . 


5 


0 


0 


•arom 1 


42 


and 


name 


hn) 


5 . 


5 


5 . 


5 


0 


0 


! arom 1 


42 


and 


name 


ha) 


5 . 


5 


5 . 


5 


0 


0 


! arom 1 


43 


and 


name 


hb#) 


5 . 


5 


5 . 


5 


0. 


0 


! arom 1 


43 


and 


name 


ha# ) 


3 . 


6 


3 . 


6 


0 


0 


! arom 1 


84 


and 


name 


hd# ) 


3 . 




3 . 


6 


0 . 


0 


j arom s 


91 


and 


name 


hd# ) 


5 . 


5 


5 . 


5 


0 . 


0 


! arom 1 


87 


and 


name 


hd# ) 


2 . 


8 


2 . 


8 


0 . 


0 


1 arom s 


87 


a nd 


name 


he# ) 


3 . 




3 . 




0 


0 


1 a rom c 


87 


a nd 


name 


hd# ) 


3 . 


6 


3 . 


6 


0 


0 


! arom s 


87 


and 


name 


he# ) 


3 . 




3 m 


6 


0 


0 


1 a rom s 


87 


and 


name 


hd# ) 


2 . 


8 


2 . 


8 


0 


0 


! arom i 


90 


and 


name 


hb) 


5 . 


5 


5 . 


5 


0 


0 


1 arom m 


90 


and 


name 


hg2#) 


5 . 


5 


5 . 


5 


0 . 


0 


! arom m 


90 


a nd 


name 


hd# ) 


3 . 


6 


3 . 


6 


0 . 


0 


1 arom m 


91 


a nd 


name 


hb# ) 


3 . 


6 


3 . 


6 


0 . 


0 


! arom m 


91 


a nd 


name 


hd# ) 


2 . 


8 


2 . 


8 


0 . 


0 


1 arom m 


Z? X 


and 


name 


h^ft ) 


3 . 




3 . 




0 . 


0 


1 a rnm m 

. C* A- ^x/lll ill 


91 


and 


name 


hdft ) 

1 ivj tr / 


5 . 


5 


5 . 


5 


0 . 


0 


1 a r*om m 




a n H 


n Amp 


heft ) 
t ic tr / 


•j • 




s 

~j • 


— ' 


0 


0 


1 a rom m 

. ai win 111 


^ X 


and 


name 


hd# ) 


j ■ 


a 


3 




0 . 


0 


! arom i 


Q? 


d 1 iu 


name 


hn ) 


~j • 








0 . 


0 


* a mm ^ 
. 0 win 0 


ft 


anH 
d 1 lvJ 


name 


hdft ) 


5 . 


5 


5 m 


5 


0 . 


0 


1 nieth 1 


ft 6 


and 


name 




s 

• 


5 


5 . 


5 


0 . 


0 


! meth _ 1 


90 


and 


name 


hdft ) 
1 'u tr / 


4 m 


\ 


4 


x 


0 . 


0 


I me t h 1 


90 

^ *j 


and 


1 1 d It LC 
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5 t 


5 


5 . 


5 
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0 


i nie t h .1 




a nd 
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hd# ) 


5 . 


5 


5 . 


5 


0 . 


0 


! meth 1 


90 
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name 


hd# ) 


5 . 


5 


5 . 


5 


0 . 


0 


1 me t h 1 
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hdft ) 


5 . 


5 


5 . 


5 


0 . 


0 


! meth 1 


90 


anH 
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hdft ) 

l IV-l tr J 




s 


S 
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n 


n 


1 m e t h _ 1 


90 


d 1 ivx 


name 


hdft ) 
i iv4 tr j 




5 


5 . 




0 . 


0 

yJ 


! methyl 


90 


anH 


name 


ho?ft 1 
1 iy 4C tr } 


5 . 




5 . 


s 


0 . 


0 


I JT\Q t h 1 


90 


anH 
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name 
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■j • 


«j 
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90 


and 
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5 
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5 
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0 
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90 
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5 


5 . 
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! meth 1 


90 
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5 . 


5 


5 . 
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1 meth 1 


40 
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heft ) 
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5 . 


5 


5 . 


5 


0 . 


0 


! met h i 


48 
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name 
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5 . 


5 


5 . 


5 


0 . 


0 


i meth s 


74 


and 


name 


hd# ) 


5 . 


5 


5 . 


5 


0 . 


0 


1 methyl 


74 
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hd# ) 


5 . 


5 


5 . 


5 


0 . 


0 


1 me t h 1 


4ft 


and 


name 
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3 . 


5 


0 . 


0 
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74 
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name 
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5 . 


5 


5 . 


s 


0 


0 
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3 . 


6 
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0 


1 me t h X 


4 9 
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hn ) 


5 . 


5 
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VJ 


n 
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5 
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5 
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0 
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i 

X 


0 . 


0 
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. HI C L. 1 I 111 


~J X 
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3 . 




3 . 


6 


0 . 


0 


l met" h m 
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J X 


anH 
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name 
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3 


6 

w 


3 


\j 


n 


0 


. UlC V. 1 I 111 
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name 
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5 


5 . 


5 


0 . 


0 
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74 
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name 
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5 . 
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5 . 


5 


0 . 


0 


! meth 1 
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hd# ) 


5 . 
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5 . 
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0 . 
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! meth 1 
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5 . 


5 


0 . 
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! me t h 1 
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0 


p me t h 1 


74 


and 


name 
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5 . 


5 


0 . 


0 


! meth i 


74 
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5. 


5 


5! 


5 


0. 
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5. 


5 


0. 
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5. 
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0. 
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0. 
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5. 
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0. 
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60 
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6 
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assign 


( res id 


fi 7 
3 I 


and 


name 


na j 


( resid 


60 


and 


name 


hhU \ 




c. 


-j . 




fi 
w . 


o 


1 rnp tr h m 


assign 


( res id 


fi7 
3 / 


and 


name 


nDtr ) 


(res id 


60 


and 


name 


1 IU tf J 






s 


5 


fi 


o 


i t* h m 

> ILIC^ V, 1 1 III 


assign 


( res id 


fiQ 
3 0 


a nd 


name 


hh tt \ 
nD it } 


( resid 


sg 


aliu 


name 


hn ) 




fi 


~i 

~j . 


fi 


fi 


o 


1 mp t h ^ 

. Ill W U 1 1 o 


assign 


( res id 


3 O 


a nd 


name 


hHJt \ 

riDff j 


( res id 


fin 


a llKJ. 


n amp 
1 1 a site 


h n ) 
1111/ 


•J . 


c: 
3 


«j . 


c. 


fi 

VJ . 


fi 


1 mp t~ h m 

. IUC V_ 1 1 tu 


assign 


( res id 


fi Q 

3 y 


and 


name 


nn ) 


( res id 


fin 


a I iu 




I IU tr / 




c; 

3 


3 . 


fi 

3 


fi 


fi 

VJ 


. lllc L- 1 1 o 


assign 


( res id 


^n 


and 


name 


hhtt \ 

nDtf j 


(res id 


fil 


and 


name 


hn ) 
iiii/ 


~j • 


fi 
o 




fi 


fi 


o 


! me t h s 


assign 


( res id 


fin 
ou 


a nd 


name 




(res id 


1 fi 


a 1 lv-l 


n amp 
1 1 cult e 


h^ \ 
i la f 


c, 

j . 


■J 


«y . 


fi 


fi 

\J • 


fi 

VJ 


i fliQ t h i 


assign 


( res id 


fin 
ou 


and 


name 


no ft j 


( resid 


7 fi 
/ o 


and 


name 


hhl \ 


c; 


c: 
3 


c: 

3 . 


fi 

3 


fi 
u . 


fi 


. nit; Lit x 


assign 


( res id 


fin 
ou 


and 


name 


u u a \ 
nDf ) 


( res id 


77 


and 


name 


hn \ 
nn ) 


-i 

o - 


fi 

o 


"3 

3 . 


fi 


fi 

U • 


fi 

VJ 


\ me t h i 


assign 


( res id 


fin 


and 


name 


nDw ; 


( resid 


77 


and 


name 


1 id / 




c: 

3 


3 . 


fi 

3 


fi 
VJ . 


fi 

VJ 


. 1UC L. 1 1 J. 


assign 


( re-s id 


fin 


and 


name 


hKtl \ 

nDtf ; 


( res id 


7 fi 


and 


name 


iiii ; 


~J ■ 


3 


3 . 


fi 
3 


n 


fi 

VJ 


t m o t" h 1 

. tltC V_ 1 1 ~L 


assign 


( res id 


fin 
OU 
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name 


now ) 


( res id 


7ft 
f o 


and 


name 


hhl \ 
no i ) 




Q 
3 


c 

3 . 


fi 
3 


n 


fi 

VJ 


! me t h 1 


assign 


{ res id 


fiO 
OZ 


and 


name 


na j 


( resid 


7 A 
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name 


hrr9 \ 
ngz ft ) 


o . 


c; 

3 


fi 

3 . 


fi 

3 


fi 
u . 


A 
VJ 


I me t h i 


assign 


( resid 


fi i 

03 


and 


name 


nn j 


( resid 


7 d 


and 


name 


ngz tf ; 


*t . 


"1 
X 


/I 

*i . 


1 
1 




A 

u 


! me t h 1 


assign 


( resid 


fi *3 


and 


name 


nn ) 


( resid 




and 


name 


naff ; 




O 


3 . 


o 


n 
u . 


fi 

u 


! me t h 1 


assign 


( resid 


C "3 

0 3 


ana 


name 


ngz If ) 


( resid 


fid 
OH 


and 


name 


nn ; 


J . 


0 


3 . 


o 


u . 


A 

u 


I me t h s 


assign 


( resid 


fi A 

o4 


ana 


name 


na ) 


( resid 


7 A 
f H 


and 


name 


h<nrO # \ 

ngz tt ) 


c 

0 . 


3 


c 

3 . 


c 
3 


u . 


fi 
u 


! me t h 1 


assign 


(resid 


C A 

64 


and 


name 


na ) 


( resid 


1 A 


and 


name 


naif ) 


c 

3 . 


3 


3 . 


c 
3 


n 
\J . 


A 

u 


! methyl 


assign 


( resid 


C A 

o4 


_ _j 
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name 


now ) 


( resid 


7 A 
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name 


naff ; 


c 

0 . 


c 
3 


c 

3 . 


c 
3 




A 

u 


! me t h 1 


assign 


( resid 


fi A 

04 


ana 


name 


ngff ) 


( resid 


7 A 


and 


name 


natf ) 


c 

3 . 


3 


3 . 


3 


n 


fi 

VJ 


! me t h 1 


assign 


{ resid 


f z 


ana 
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nn ) 
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7 9 
/ Z 


and 
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neff ; 


-5 

J> . 


o 


3 . 


O 


n 
u . 


fi 
u 


! me t h i 


ass ign 


(resid 


/ Z 


_ _ _j 
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nDtT ; 
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c: 

3 • 
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3 


C 

3 . 


fi 

3 


n 
u . 


fi 


1 m a +• h m 
; lllc U 11 HI 


as sign 


( resid 


1 3 
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name 
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( res id 
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and 
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nut ) 
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o • 


c: 


c: 

3 • 


fi 


n 


A 
VJ 


! me t h s 


assign 


( resid 


1 A 

1 4 
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name 


nn ) 
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7 A 
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ngz ft j 


o « 


fi 
o 


-3 

3 . 


fi 


fi 

u . 


A 
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! me t h i 


assign 
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1 A 

1 4 
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name 
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7 A 


and 


name 


ngz ff ; 




o 


r> 

3 . 


o 


fi 
u . 


fi 


! me t h i 


assign 


( resid 


1 A 

1 4 


and 


name 


na ; 


( resid 


7 A 


and 


name 


naff ) 


-> 


0 


-3 

3 . 


o 


fi 
u . 


fi 


! me t h i 


assign 


( resid 


1 A 

1 4 
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name 
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ngz ff j 
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7 ^ 
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na / 


c 

D . 


c 
3 


c 

3 . 
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3 
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u . 


fi 

VJ 


! me t h s 


assign 
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/ 3 
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name 


ngz tf ) 


( res id 
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h n \ 

nn ) 


-3 

3 . 


fi 
O 


3 . 


o 


fi 


fi 


! me t h s 
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( resid 


7 3 
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( resid 
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/ y 


and 
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-3 

J . 


0 


-3 
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n 
U . 
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U 
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assign 
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1 3 


and 
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( resid 


O U 


and 


name 
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«J - 


0 


-3 

3 . 


O 


u . 


A 

u 


! me t h s 


assign 


( resid 


8 3 


and 


name 


na ) 


( resid 


Q fi 

o 0 


and 


name 


naff ) 


C 

D . 


3 


3 . 


c 
3 


U . 


A 

u 


! me t h s 


assign 


(resid 


8 6 


and 


name 


nd# ) 


\ resia 


Q "7 


and 


name 


nn ) 


c 

D . 


3 


c 

3 . 


3 


A 
U . 


A 
U 


! met h s 


assign 


( resid 


o c 
0 0 


and 


name 


ndlf ) 


( resid 


Q 1 

y l 


and 


name 


nDtf ; 


c 

0 . 


c 


3 . 


c 


u . 


A 
U 


i me t n_i 


assign 


( resid 


0 0 


and 


name 


na» ) 


( resid 


y z 


and 


name 


na | 


c 

3 . 


c 
3 


c 

3 . 


c 
3 


fi 


A 


! me t h 1 


assign 


( res id 


Q T 

0 / 


and 


name 


na ) 


( resid 


Qfi 

y u 


and 


name 


HH it \ 

naff ) 


c 

3 . 


3 


3 . 


3 


fi 


fi 
U 


: aictn m 


assign 


( resid 


O 1 


ana 


name 


nDtT ; 


( resid 


Qfi 

y u 


and 


name 


ngz ft ; 


c: 

3 . 


c: 
3 


c: 

3 . 


c 
3 


fi 


A 
U 


: irie in in 


assign 


( resid 


o / 


ana 


name 


nDtf ) 


( resid 


on 
y u 


and 


name 


KH it \ 

naff / 


3 . 


C 
3 


fi 

3 . 


fi 
3 


fi 


fi 


! me v_ n in 


assign 


( resid 


o 3 


and 


name 


nn ) 


( resid 


on 
y u 


and 


name 


ngz ff ) 


c 

3 . 


3 


3 . 


3 


fi 


fi 


! me t h s 


assign 


( res id 


yu 


ana 


name 


nn j 


( resid 


y u 


and 


name 


ngz ff j 


3 . 


fi 

o 


3 . 


fi 


fi 


fi 
u 


! met h i 


assign 


( resid 




_ _ _j 
and 


name 


na ; 


( resid 


on 
y u 


and 


name 


ngz ff ; 


3 • 


fi 
o 


3 . 


O 


fi 


fi 
u 


! me t h i 


assign 


( resid 


Q A 


_ _ _j 
ana 


name 


ngz » ) 


( resid 


Q 1 


and 


name 


nn ; 


3 . 


3 


3 . 


3 


fi 


fi 


! me t h s 


assign 


( res id 


on 
yU 


_ _ -j 
and 


name 


natf ) 


( resid 


Q 1 

y i 


and 


name 


nn / 


C 

3 . 


c 
3 


3 . 


fi 
3 


A 

u . 


fi 


: me l. n 


assign 


( resid 


b 


ana 


name 


na j 


( resid 


o 


and 


name 


hn it \ 

ngff ) 


Z . 


Q 
O 


z . 


O 


fi 
u . 


fi 

VJ 


: □vjnm j. 


assign 


( resid 


i n 


ana 


name 


neff ) 


( resid 


jl y 


and 


name 


nn j 


3 . 


o 


3 . 


a 
o 


fi 


A 

u 


: agnm i 


assign 


( resid 


1 A 
10 


ana 


name 


nef ) 


( resid 


z o 


and 


name 


hn \ 

nn ; 


3 . 


o 


■3 

3 . 


O 


fi 


A 

u 


I ^9 Try 1 

: agnm x 


assign 


( resid 


1 A 

1 4 


and 


name 


nn ) 


( res id 


1 A 


and 


name 


ngff j 


3 . 


fi 

o 


3 . 


fi 


fi 


fi 
u 


1 H /"<r r> m i 

. agnm i 


assign 


( resid 


1 A 

1 4 


ana 


name 


l\DW ) 


( resid 


1 ^ 
1 3 


and 


name 


nnj 


/I 


1 


** . 


i 
i 


fi 


fi 
u 


1 H j-t n m c? 

i agnm s 


assign 


{ resid 


9 A 
Z H 


and 


name 


nn/ 


( res id 


7 A 


a nd 


name 


ng» ; 


-5 

_) . 


fi 

D 


J • 


fi 

D 


fi 


fi 

VJ 


1 /4rrnm i 
. Uu 1 1111 X 


ass ign 


( res id 


9 A 
Z H 


and 


name 


K -a \ 

na ; 


( res id 


0 A 


and 


name 


ho E ) 
ngff ) 


9 


O 
O 


9 


p 
O 


n 


A 
VJ 


f H/*mm i 

: uyniu ± 


assign 


/ VAC* <i H 

\ res ia 


Z4 


and 


name 


ngff ) 


( resid 


Z D 


and 


name 


1111 / 


c 

3 • 


c 

3 


3 . 


c: 
3 


n 


A 

u 


: agnm s 


assign 


( resid 


O fi 
Z3 


and 


name 


nn / 


( resid 


Z D 


and 


name 


naff / 


c. 

3 . 


c: 

3 


fi 

3 . 


fi 

3 


fi 


fi 

VJ 


1 H i^r r*k m •? 

: agnm i 


assign 


( resid 


9 fi 
Z 0 


and 


name 


nn ; 


(res id 


9 fi 
z o 


and 


name 


I ID ft ) 


9 

z . 


Q 
0 


9 


p 
O 


fi 


A 


1 H on m I 

* agnm x 


assign 


( resid 


z y 


ana 


name 


nDtf ) 


( resid 


9 Q 

z y 


and 


name 


hHtt \ 

naff / 


3 . 


0 


"3 

3 . 


o 


A 

u . 


A 
U 


! dgnm i 


assign 


( resid 


1 A 

23 


and 


name 


nDtf ) 


( resid 


JU 


and 


name 


nn ; 


-3 

3 . 


0 


3 . 


c 
O 


U . 


rv 
U 


! dgnm^s 


assign 


( resid 


*3 *3 


_ _ _j 
ana 


name 


nn ; 


( resid 


4 / 


and 


name 


h n it \ 

ngff ; 


c 

3 . 


c 
3 


c 

3 . 


3 


A 

U . 


A 

U 


! dgnm 1 


assign 


( resid 


*3 c 


ana 


name 


nn ) 


( resid 


3 D 


and 


name 


h r*i it \ 

ngff j 


■3 

3 . 


0 


-a 

3 . 


O 


A 

u . 


A 
U 


! dgnm_i 


assign 


( resid 




^ — _j 
ana 


name 


na ) 


( resid 


3 !) 


and 


name 


h/~i it \ 

ngff j 


*3 

3 . 


O 


-3 

3 . 


£L 
O 


A 

u . 


A 

u 


! dgnm i 


as s i an 

t» O U X \J 1 1 


(res id 


35 


a nd 


name 


ha# ) 


( resid 


36 


and 


name 


hn) 


3. 


6 


3 . 


6 


0. 


0 


! dgnm s 


assign 


(resid 


35 


and 


name 


hg#) 


( resid 


42 


and 


name 


hn) 


5. 


5 


5. 


5 


0. 


0 


! dgnm_i 


assign 


(resid 


35 


and 


name 


hg#) 


(resid 


44 


and 


name 


ha) 


3. 


6 


3. 


6 


0. 


0 


! dgnm_l 


assign 


(resid 


35 


and 


name 


hg#) 


(resid 


44 


and 


name 


hb#) 


3. 


6 


3. 


6 


0. 


0 


! dgnm_l 


assign 


( resid 


36 


and 


name 


hn) 


( resid 


36 


and 


name 


hg#) 


3. 


6 


3. 


6 


0. 


0 


! dgnm_i 


assign 


(resid 


36 


and 


name 


ha) 


( resid 


36 


and 


name 


hg#) 


2. 


8 


2. 


8 


0. 


0 


! dgnm_i 


assign 


( resid 


36 


and 


name 


hb#) 


(resid 


37 


and 


name 


hn) 


5. 


5 


5. 


5 


0. 


0 


! dgnm_s 


assign 


( resid 


36 


and 


name 


hg#) 


(resid 


37 


and 


name 


hn) 


3. 


6 


3. 


6 


0. 


0 


! dgnm_s 
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assign 


( resid 


36 


and 


name 


hg#) 


( resid 


40 


and 


name 


hn) 


5. 


5 


5. 


5 


0 


.0 


! dgnm_m 


assign 


(resid 


37 


and 


name 


ha) 


(resid 


37 


and 


name 


hg#) 


3. 


6 


3. 


6 


0 


.0 


! dgnm_i 


assign 


( resid 


37 


and 


name 


hg#) 


( resid 


38 


and 


name 


hn) 


4 . 


1 


4 . 


1 


0 


.0 


! dgnm_s 


assign 


( resid 


40 


and 


name 


hn) 


( resid 


40 


and 


name 


hg#) 


3. 


6 


3. 


6 


0 


.0 


! dgnm_i 


assign 


(resid 


40 


and 


name 


hn) 


(resid 


40 


and 


name 


hd#) 


5. 


5 


5. 


5 


0 


.0 


! dgnm_i 


assign 


( resid 


4 4 


and 


name 
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(resid 


45 


and 


name 


hg#) 


3. 


6 


3. 


6 


0 


.0 


! dgnm_s 


assign 


(resid 


45 


and 


name 


hg#) 


(resid 


46 


and 


name 


hn) 


3. 


6 


3. 


6 


0 


.0 


! dgnm_s 


assign 


(resid 


47 


and 


name 


hg#) 


(resid 


48 


and 


name 


hn) 


5. 


5 


5. 


5 


0 


.0 


! dgnm_s 


assign 


( resid 


49 


and 


name 


hn) 


(resid 


51 


and 


name 


hg#) 


3. 


6 


3. 


6 


0 


.0 


! dgnm m 


assign 


( resid 


49 


and 


name 
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-86- 

Example 3: Identification of blocking antibodies using a competitive binding assay 
and immobilised wild type GST-MSP-1 19. 

In previous studies antibodies that blocked the action of the neutralising antibodies 12.8 
5 and 12.10 had been defined either directly in the MSP- 1 42 processing assay (Blackman et 
a/., 1994) in a coupled erythrocyte invasion-MSP-l 4 2 processing assay (Guevara et al t 
1997) or in a competitive radioimmunoassay with merozoite protein as the antigen 
(Guevara et al, 1997). These studies have been extended using recombinant MSP-1 and 
BIAcore analysis. 

10 

A recombinant fusion protein comprising wild type MSP-1 19 fused to GST was coupled to 
the sensor chip and competitor antibody was first allowed to bind to the antigen. Then a 
solution of either mAb 12.8 or 12.10 was passed over the chip and the amount of binding 
of this second antibody was quantified. If the first antibody interferes with the binding of 
15 the second antibody then this is reflected in a reduction in the amount of second antibody 
bound. 



Methods 

The wild type GST-MSP-1 19 was coupled to a CMS sensor chip. The binding assays were 
20 performed with a constant flow rate of 5 \x\ min 1 at 25°C. For binding, purified mAbs 
1E1, 8 A 12 and 2F10 at 100 ^ig ml" 1 in HBS-EP buffer (iOmM HEPES pH7.4 containing 
150mM NaCl, 3mM EDTA and 0.005%v/v polysorbate 20); mAbs 1E8, 9C8, 12D11, 
Hi. 2 and 111.4 in cell culture medium supernatant; mAbs 2.2, 7.5 and 89.1 at 1:10 
dilution of ascitic fluid in HBS-EP buffer; and mouse a-GST antibody at 1 : 1 0 -dilution 
25 serum in HBS-EP buffer were allowed to interact with immobilised wild type GST-MSP- 
1 19 for 10 min. After allowing 5 min for dissociation of low affinity interactions, either 
mAb 12.8 or 12.10 at 100 |ig mf 1 in HBS-EP buffer was added and allowed to bind for 10 
min. After washing the chip for 5 min the binding of 12.8 or 12.10 was measured. The 
chip was regenerated by washing off bound antibody with 10 mM glycine-HCl, pH 2.4, or 
30 when required with 100 mM glycine-HCl, pH 1 .8, for 3 min. 
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Results 

The results are shown in Figure 12. AH the competitor antibodies bind to the GST-MSP- 
1 19 antigen, with the exception of mAb89.1 which is a negative control. As expected, 
5 mAbs 12.8 and 12. 10 competed with each other (Guevara et al t 1997). The other 
antibodies which do not inhibit processing could to greater or lesser interfere with the 
binding of 12.8 and 12.10. As expected from previous studies mAbs 1E1, and 7.5 
blocked both 12.8 and 12.10, whereas 2.2 and 1 1 1 .4 blocked 12.8. Another particularly 
effective blocking antibody identified in this study was mAb9C8. 

10 

Example 4; Immunization of small animals with modified GST-MSP-lio and analysis 
of the antibodies induced 

To determine whether or not the modified proteins were immunogenic, recombinant GST- 
15 MSP- 1 19 fusion proteins were used to raise antibodies by immunisation. 

Methods 

Two modified proteins containing either 3 [27+3 1+43] or 4 [15+27+31+43] amino acid 
substitutions, respectively, were used to immunise rabbits and mice. The rabbits were 
20 immunised subcutaneously with MSP- 1 19 protein in Freund's complete adjuvant and then 
boosted on three occasions with 200 ng of the protein in Freund's incomplete adjuvant 21, 
42 and 63 days later, and serum samples were collected. 

The presence and level of antibodies binding to the native MSP-1 protein in the parasite 
25 was assessed by indirect immunofluorescence using acetone fixed smears of parasite- 
infected erythrocytes. The sera were diluted serially in phosphate buffered saline (PBS) 
and incubated on the slide for 30 min at room temperature. After washing, the slides were 
incubated with FITC conjugated goat anti-rabbit or anti-mouse IgG, washed, and then 
examined by fluorescence microscopy. 

30 

The sera were also analysed in an MSP-1 secondary processing assay. Analysis and 
quantitation of secondary processing of MSP-1 in merozoite preparations was by a 
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modification of an assay described previously (Blackman et ai, 1994). Washed P. 
falciparum 3D7 merozoites were resuspended in ice-cold 50 mM Tris-HCl pH 7.5 
containing 10 mM CaCl 2 and 2 mM MgCb (reaction buffer). Aliquots of about 1 x 10 9 
merozoites were dispensed into 1.5 ml centrifuge tubes on ice, and the parasites pelleted 
5 in a microfiige at 13,000 x g for 2 minutes at 4°C. The supernatant was removed, and 
individual merozoite pellets were then resuspended on ice in 25 |al of reaction buffer 
further supplemented with protease inhibitor or antibodies as appropriate. Merozoites 
were maintained on ice for 20 min to allow antibody binding, then transferred to a 37°C 
water bath for one hour to allow processing to proceed. Assays always included the 

10 following controls: a "positive processing" control sample of merozoites resuspended in 
reaction buffer only; a "negative processing" control sample of merozoites resuspended in 
reaction buffer plus ImM PMSF; and a zero time (Oh) control, in which processing was 
stopped before the 37°C incubation step. The processing was assayed using the western 
blot-based method and by a modified processing assay. Supernatants from the assays 

15 were obtained after centrifugation for 30 min at 4°C, 13,000 x g to remove the insoluble 
material. The amount of MSP- 1 33 in the supernatants was measured using an ELISA 
method. Fifty microlitres of diluted sample supernatants were added to the wells of an 
ELISA plate (NUNC F96 Cert. Maxisorp) that had been coated with 100 ^l/well of 4 ^ig 
ml" 1 human mAb X509 in PBS. Plates were incubated for 4 hours at 37°C and then 

20 washed 3 times with 0.01% PBS-Tween (PBS-T). Bound MSP- 1 33 protein was detected 
by addition of 100 [il of 1:4000 dilution of mouse mAb G13 for 1 hour at 37°C, followed 
by washing and the addition of 100 \il of 1:1000 dilution of sheep anti-mouse IgG (H+L) 
HRP-conjugated antibody. After incubation for 1 h at 37°C, the plates were washed again 
and HRP was detected by the addition of 100 \xl of freshly prepared substrate solution 

25 (400mg r 1 o-phenylenediamine dihydrochloride in 0.05 M phosphate buffer, 0.024 M 
citric acid and 0.012% H2O2) at room temperature for 20 min. The reaction was stopped 
by adding 10 ^1 of 1 M sulphuric acid and the absorbance of each sample was measured at 
492 run. 

30 Results 

The results are shown in Figure 13. The two modified proteins produced antibodies that 
reacted with MSP-1 in the parasite-infected erythrocyte, with a serum titre of 1 : 10,000, 
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which was an identical titre to that of a serum produced in the same way by immunisation 
with a recombinant protein containing the wildtype MSP-1 sequence. This indicates that 
the modified proteins can produce antibodies that react with the native protein. The 
antibodies induced by immunization were able to partially inhibit processing at the 
5 concentration used in a preliminary experiment, whereas in the control serum no 
antibodies that inhibited processing were present. 

Example 5: Design and Synthesis of a Plasmodium Falciparum Merozoite Surface 
Protein-1 Gene Fragment Optimized for Pichia Pastoris Heterologous Expression 

10 

The coding sequence of the Plasmodium falciparum merozoite surface protein-1 (MSP1) 
41.1 kDa processed fragment (MSP- 142) has been redesigned for optimal heterologous 
expression in the yeast Pichia pastoris. The optimized DNA sequence was synthesized by 
PCR gene assembly, in the form of two fragments, MSP- 133 and MSP-1 19. P. pastoris 
15 was transformed with an expression vector containing the optimized MSP-1 19 construct. 
Recombinant strains were shown to express high levels of non-glycosylated, properly 
folded MSP- 1 1 9 protein. 



Proteins encoded by the AT-rich genome of the human malaria parasite Plasmodium 
20 falciparum are generally poorly expressed in heterologous systems (Withers-Martinez et 
al., 1999). The methylotrophic yeast Pichia (Komagataella) pastoris is an appropriate 
system for expression of disulphide-bridged proteins such as the C-terminal fragment of 
the P. falciparum merozoite surface protein-1 (MSP1) (White et al., 1994; Morgan et al., 
1999). In the P. pastoris expression system, it is important to avoid premature 
25 transcription termination due to AT-rich stretches (Romanos et al., 1991). Codon 
preferences for highly expressed genes in P. pastoris have been identified (Sreekrishna et 
al., EP 0 586 892 Al). Therefore, a synthetic MSP-142 gene fragment with codon usage 
optimized for P. pastoris expression was designed, using novel computer software 
(Withers-Martinez et al., 1999). It has previously been shown that the MSP-1 19 fragment 
30 is partially glycosylated when expressed in P. pastoris, and the carbohydrate must be 
enzymatically removed during purification (Morgan et al., 1999). Therefore, two specific 
point mutations were introduced into the synthetic MSP-142 protein sequence in order to 
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prevent N-linked glycosylation at NxS/T sites (one potential site within the MSP- 133 
sequence, and a known site within the MSP-1 19 sequence at Asn 1). 

The optimized MSP- 142 sequence was synthesized by gene assembly polymerase chain 
5 reaction (Stemmer et al., 1995, Withers- Martinez et al., 1999), in the form of separate 
MSP-133 and MSP-1 19 fragments. The .optimized MSP-1 19 fragment was subcloned into 
a novel modified Pichia expression vector, transformed into the P. pastoris host strain 
SMD1168, and several independent transformants were isolated. There transformants 
were shown to efficiently express non-glycosylated, properly folded MSP-1 19. Strong 
10 expression of the optimized gene was observed in low copy number transformants. A 
multiple copy transformant with intermediate level G418 resistance gave expression of 
purified MSP-1 19 at a level equal to the high-expressing strain previously described 
(Morgan et al., 1999), which contains the original P. falciparum DNA. Thus, it should be 
possible to obtain even higher yields from high level G41 8-resistant transformants of the 
15 synthetic optimized gene. 

Methods: Gene Assembly 

The R falciparum MSP- 142 (41.1 kDa) fragment protein sequence SWISS-PROT 
20 accession number P04933: positions 1264-1621) was first altered to eliminate N-linked 
glycosylation signals by 2 amino acid substitutions. The sequences NYT (in the N- 
terminal portion; position 1445) and NIS (at the beginning of the C-terminal fragment; 
position 1526) were changed to QYT and NLA respectively. The protein sequence was 
then reverse-translated with DNA-STAR using the 5. cerevisiae codon preferences. This 
25 sequence was used as input for the CODOP program (Withers-Martinez et al., 1999). Ten 
random sequences were generated with this program, using a codon weighting table 
(Figure 14) derived from codon usage in highly expressed P. pastoris genes (Sreekrishna 
et al., EP 0 586 892 Al). Thus, the codon table should reflect usage in highly expressed 
genes, rather than average usage. The random sequence that contained the minimum 
30 number of unfavourable codons (6) was selected, and these codons were changed 
manually to more preferred alternatives. The sequence was then analysed with DNA- 
STAR to check for AT-rich sequences that may cause transcription termination, and for 
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direct and inverted repeats. A set of 50 overlapping oligonucleotides coding for the final 
sequence was then generated. This consisted of 49 oligonucleotides of length 42 nt, and 
one of length 48 nt. Each oligonucleotide had a 21 bp overlap with its neighbours, with no 
gaps. Estimated T m s were in the range of 60°C to 77°C. Oligonucleotides were synthesised 
5 by Oswel (Southampton, UK) at 40 nmol scale, and supplied in deionised water without 
purification. Outside primers of various lengths for the amplification step were also 
synthesised, to give a T m of 62°C to 64°C, and contained a 5'-terminal phosphate group 
for ligation following the amplification step. The reverse primers also included a 
translation termination codon (UAA in the complementary strand). All oligonucleotides 
10 were diluted to 10 (aM in ddH 2 0 before use. 

The PCR-mediated gene assembly and amplification were carried out as described 
(Stemmer et a/., 1995; Withers-Martinez et ah, 1999), using a Biometra cycler, in thin- 
walled 200 jiL tubes, under the following conditions. 

15 

Gene assembly reactions (Reaction 1): 

50 nL volume 

2 units Vent polymerase (New England Biolabs) 
20 0.4 mM dNTPs 

1 x Vent polymerase buffer 

Oligonucleotide mix containing each oligonucleotide at 200 nM 

Cycles: 

25 32 cycles (2 h 33 m) 

denaturation 94°C 30 s 
annealing 52°C 30 s 
extension 72°C 3 m 

30 Three fragments of the MSP- 142 (41.1 kDa) region were synthesised separately with 
different outside primers and subsets of the 50 oligonucleotide set: 
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N-terminal fragment (bp 1-423) 21 oligonucleotides 
middle fragment (bp 337-786) 22 oligonucleotides 
C-temiinal fragment (bp 787-1074) 14 oligonucleotides 

5 The C-terminal fragment produces a 10.6 kDa fragment (MSP- 1 19). The N-terminal and 
middle fragments, which overlap between positions 337 and 423, were subsequently 
spliced together at the Bglll site (371-376) to give a 786 bp fragment that encodes the 30.5 
kDa MSP- 133 protein. 

10 Amplification reactions (Reaction 2): 

1 00 \xL volume 

10 faL aliquot of the gene assembly reaction 
4 units Vent polymerase 
15 0.4mMdNTPs 

1 x Vent polymerase buffer 
1 |iM outside primers 

Cycles: 

20 32 cycles (2 h 55 m) 

denaturation 94°C 45 s 
annealing 52°C 45 s 
extension 72°C 3 m 
final extension 72°C 5 m 

25 

The PCR products were then purified by filtration with Centricon-100 units (Amicon), 
and cloned directly into the vectors by blunt-end ligation overnight at 16°C with T4 DNA 
ligase. 

30 The synthetic MSP-1 19 gene was cloned directly into a P. pastoris expression vector. The 
modified pPIC9KHXa vector, containing a His 6 tag and factor Xa cleavage site (see 
Figure 15) inserted in the pPIC9K SnaBI site, had been digested with Pmll and treated 
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with calf alkaline phosphatase. The HXa vector had been previously created by insertion 
of a 36 bp synthetic oligonucleotide, containing the His 6 tag, factor Xa cleavage site, and 
Pmll restriction site into the SnaBI site of the pPIC9K vector. 

The N-terminal and middle fragment PCR products were cloned into the Smal site of the 
dephosphorylated pUC118 vector. Plasmid clones containing inserts were sequenced. 
Clones with the correct synthetic sequence were then digested and the two fragments were 
gel-purified. The N-terminal fragment clones were digested with EcoRI and Bglll, and the 
middle fragment clones were digested with Hindlll and Bglll. The recombinant fragments 
were purified on an agarose gel, and eluted with a QIAGEN extraction kit. The purified 
N-terminal and middle fragments were then spliced together by ligation into a pUC118 
vector that had been digested with Hindlll and EcoRI and treated with calf alkaline 
phosphatase. This created the complete synthetic MSP- 133 coding sequence. The N- 
terminal and middle fragment PCR products were cloned into the Smal site of the 
dephosphorylated pUC118 vector. Plasmid clones containing inserts were sequenced. 
Clones with the correct synthetic sequence were then digested and the two fragments were 
gel-purified. The N-terminal fragment clones were digested with EcoRI and Bglll, and the 
middle fragment clones were digested with Hindlll and Bglll. The recombinant fragments 
were purified on an agarose gel, and eluted with a QIAGEN extraction kit. The purified 
N-terminal and middle fragments were then spliced together by ligation into a pUC118 
vector that had been digested with Hindlll and EcoRI and treated with calf alkaline 
phosphatase. This created the complete synthetic MSP- 133 coding sequence. 

Methods: Expression and Purification 

25 

The methylotrophic yeast Pichia (Komagataella) pastoris strain SMD1168 was 
transformed by electroporation as described previously (Morgan et al., 1999). In addition, 
some G418-resistant clones were isolated using Hybond-N+ membranes (Fairlie et al., 
1999). 

30 

Expression screening of transformants was performed by growing 10 ml cultures in 
buffered minimal glucose medium. Cells were harvested and resuspended in 10 ml 
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buffered minimal methanol medium at 1.0 OD 6 oo and grown overnight to a final OD 6 oo of 
2.5 to 3.0. Cells were removed by centrifiigation, and 1.2 ml of the supernatant medium 
was precipitated 30 min on ice with 15 % trichloroacetic acid. The samples were 
centrifuged for 30 min at 14000 rpm at 4 °C in a microfuge, and the protein pellets were 
5 washed twice with cold acetone. Samples were resuspended in 12 jj.1 ddH 2 0 3 and 5 yil was 
electrophoresed, after reduction with DTT, on NOVEX pre-poured acrylamide gels 
according to manufacturer's instructions. NOVEX 4-12 % acrylamide gradient, or 10 % 
acrylamide, Bis/Tris gels in MES buffer were used. Protein gels were stained with 
Coomassie colloidal Brilliant Blue stain (Sigma). 

10 

Homogeneously purified MSP-119 was obtained as described previously (Morgan et al., 
1999), except that enzymatic deglycosylation was omitted for the synthetic gene products. 

Methods: NMR 

15 

One-dimensional ! H- and 2-dimensional ^H/^NJ-HSQC spectra were acquired as 
described previously (Morgan et al, 1999), at 25 °C, at sample concentrations of 1.1-2.5 
mM. 

20 Results 

The sequences of the synthetic DNA fragments, and the resulting predicted protein 
products, are shown in Figure 15. A summary of the resulting improvements to the 
sequence is given in Table 3. 



MSOOCID: <WO 0063245A3_IA> 



WO 00/063245 



-95- 



PCT/G B00/0 1558 



Total codons P. pastoris preferred Unfavourable %ATconte 
codons codons 



P. falciparum MSP1 358 140 28 74 

41.1 kDa fragment 

Synthetic 41.1 kDa 358 276 0 58 

fragment 

Table 3. Codon usage 

5 

PCR-gene assembly reactions for the MSP- 133 (two sections) and MSP-119 synthetic 
fragments are shown on agarose gels in Figure 16. This demonstrated that a single, correct 
size major product was observed in each case. The PCR products were subcloned, 
screened, and sequenced as described in the Methods section. 

10 

P. pastoris was transformed with the synthetic MSP-119 construct in the modified 
pPIC9K expression vector (pPIC9K-HXa; Figure 15). Expression of the synthetic MSP- 
119 product in three independent transformants is shown on a protein gel in Figure 17. 
The protein samples were prepared by trichloroacetic acid precipitation from culture 

15 supernatants as described in the Methods section. This demonstrated that a single, major 
product was present in each sample, corresponding to the expected migration of the 
synthetic MSP-1 19 protein. This migrated slightly more slowly than the controljample, 
which as described previously (Morgan et al., 1999) has a shorter N-terminal tag 
sequence. There was no trace of heterogeneous, slowly migrating recombinant protein that 

20 would result from glycosylation. Therefore, non-glycosylated, synthetic MSP-119 is 
efficiently expressed by the transformed yeast. The yield (measured by UV absorbance) 
of purified MSP-119 was 16 mg/L for low copy number transformants (resistant to 0.25 
mg/ml G418), and increased to 24 mg/L for intermediate G418 resistance (resistant to 1.0 
mg/ml G418). This can be compared with yields of 1-2 mg/L for low copy number 

25 transformants of P. pastoris with the original Plasmodium falciparum coding sequence, 
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before isolation of a highly G418-resistant strain (Morgan et al., 1999). This indicated 
that the synthetic MSP- 1 19 construct is advantageous for recombinant protein expression, 
and that further improvement would result from isolation of higher copy number 
transformants. 

5 

One-dimensional proton NMR experiments demonstrated that the synthetic MSP- 119 
protein spectrum was very similar to the previously studied protein (Morgan et al., 1999), 
and represented a correctly folded protein (data not shown). This was further confirmed 
by a 2D-{ l H/ i5 N}-HSQC spectra (Figure 18), which also shows that the structure of the 

10 synthetic product is identical to the previously studied protein, except for slight 
differences at the N-terminus which are consistent with the presence of a distinct N- 
terminal tag sequence, and S3->A mutation at the glycosylation site. Backbone NH 
proton and I5 N chemical shifts for the original P. falciparum sequence product have been 
previously presented (Morgan et al., 1999). The similarity between the two spectra, 

15 outside of the N-terminal region, is strong evidence that both protein forms are in a 
structurally similar, correctly folded state. 
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CLAIMS 

1. A non-naturally occurring variant of a C-terminal fragment of a Plasmodium 
merozoite surface protein- 1 (MSP-1) wherein said variant has (i) a reduced affinity, 

5 compared with a naturally occurring Plasmodium MSP-1 19, for at least one first antibody 
capable of blocking the binding of a second antibody, which second antibody inhibits the 
proteolytic cleavage of Plasmodium MSP-1 42 and (ii) substantially the same affinity for at 
least one third antibody compared with said naturally occurring Plasmodium MSP-1 19. 
which third antibody inhibits the proteolytic cleavage of Plasmodium MSP-I42. 

10 

2. A variant according to claim 1 wherein said Plasmodium MSP-1 19 and MSP-I42 
are Plasmodium falciparum MSP-1 19 and MSP-1 42- 

3. A variant according to claim 1 or 2 wherein the first antibody is selected from 
15 mAbs lEl,2.2,7.5,9C8and 111 A 

4. A variant according to claim 1 or 2 wherein the second and/or third antibody is 
selected from mAbs 12.8, 12.10 and 5B1. 

20 5. A non-naturally occurring variant of a C-terminal fragment of a Plasmodium 
merozoite surface protein- 1 (MSP-1) comprising an amino acid modification at any one of 
amino acid residues 14, 15, 27, 31, 34 , 43, 48 and 53 of the Plasmodium falciparum 
MSP-1 19 amino acid sequence shown as SEQ I.D. No. 1 or their equivalent positions in 
other Plasmodium MSP-1 19 polypeptides. 

25 

6. A variant according to claim 5 wherein said modifications are selected from 
G lnl4-»Arg, Glnl4->Gly, Asnl 5->Arg, Glu27-»Tyr, Leu3 1 -*Arg, Tyr34->Ser, 
Tyr34-»Ile, Glu43->Leu, Thr48->Lys and Asn53-»Arg and their equivalents in other 
Plasmodium MSP-1 19 polypeptides. 

30 

7. A variant according to claim 6 wherein said substitutions are selected from 
[Glu27->Tyr, Leu31->Arg and Glu43-*Leu], [Glu27-»Tyr, Leu31->Arg, Tyr34-»Ser 
and Glu43-»Leu], [Asnl5->Arg, Glu27->Tyr, Leu31->Arg and Glu43->Leu] and their 
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equivalents in other Plasmodium MSP-1 19 polypeptides. 

8. A variant according to any one of claims 5 to 7 further comprising a mutation at 
Cysl2 and/or Cys28 of the Plasmodium falciparum MSP- 1 19 amino acid sequence shown 

5 as SEQ LD. No. 1 . 

9. A variant according to claim 8, in which the mutation is a substitution selected 
from Cysl2-+Ile and Cys28^Trp, and Cysl2-+Ala and Cys28-+Phe. 

10 10. A variant according to claim 9, in which the substitutions are selected from 
[Cysl2^Ile, Asn 15^Arg, Glu27— Tyr, Cys28^Trp, Leu31— Arg, Glu43^Leu], 
[Cysl2— He, Asn 15-^Arg, Glu27^Tyr, Cys28^Trp, Leu31-»Arg, Glu43—Leu, 
Asn53^Arg], [Cysl2-»Ile, Asn 15^Arg, Glu27-+Tyr, Cys28^Trp, Leu31-*Arg, 
Tyr34— >Ser, Glu43— >Leu, Asn53— »Arg] and their equivalents in other PlasmodiumMSP- 

15 1 19 polypeptides. 

11. A method for producing a Plasmodium MSP-1 variant for use in preparing a 
vaccine composition which method comprises modifying one or more amino acid residues 
of a Plasmodium MSP-1 C-terminal fragment such that the resulting derivative has (i) a 

20 reduced affinity, compared with a naturally occurring Plasmodium MSP- 1 19, for at least 
one first antibody capable of blocking the binding of a second antibody, which second 
antibody inhibits the proteolytic cleavage of Plasmodium MSP- 1 42 and (ii) substantially 
the same affinity for at least one third antibody compared with said naturally occurring 
Plasmodium MSP-1 19, which third antibody inhibits the proteolytic cleavage of 

25 Plasmodium MSP- 1 42. 

12. A non-naturally occurring Plasmodium MSP-1 variant obtained by the method of 
claim 1 L 

30 13. A polynucleotide encoding a variant according to any one of claims 1 to 10 or 12 
operably linked to a regulatory sequence capable of directing the expression of said 
nucleotide in a host cell. 
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14. 14. A polynucleotide according to claim 13 having a sequence optimised for 
expression in said host cell. 

15. A polynucleotide according to claim 13 or 14, in which the host cell is a Pichia 
5 pastoris cell. 

16. A nucleic acid vector comprising a polynucleotide according to claim 13, 14 or 15. 

1 7. A host cell comprising a vector according to claim 1 6. 

10 

18. A pharmaceutical composition comprising a variant according to any one of claims 
1 to 10 or 12, a polynucleotide according to any one of claims 13 to 15 or a vector 
according to claim 16 together with a pharmaceutically acceptable carrier or diluent. 

15 19. A composition according to claim 18 further comprising an immunogenic 
Plasmodium polypeptide or fragment or derivative thereof. 

20. A method for producing anti-MSP-l antibodies which method comprises 
administering a polypeptide according to any one of claims 1 to 10 or 12, a polynucleotide 

20 according to any one of claims 13 to 15 or a vector according to claim 16 to a mammal. 

21. A method for producing polyclonal anti-MSP-l antibodies which method 
comprises administering a polypeptide according to any one of claims 1 to 10 or 12, a 
polynucleotide according to any one of claims 13 to 15 or a vector according to cjaim 16 

25 to a mammal and extracting the serum from said mammal. 

22. An antibody produced by the method of claim 20 or 2 1 . 

23. A method of inducing immunity against malaria induced by Plasmodium 
30 falciparum which comprises administering to a person in need of such immunity an 

effective amount of the variant of claim 1, the polynucleotide of claim 13 or the vector of 
claim 16. 
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24. A method of immunizing a mammal, said method comprising administering an 
effective amount the polypeptide of claim 1, the polynucleotide of claim 13 or the vector 
of claim 16. 

5 

25. The method of claim 24, wherein said mammal is immunized against malaria. 

26. A method of treating a malaria infection in a human patient which comprises 
administering to the patient an effective amount of the pharmaceutical composition of 

10 claim 18 or 19. 

27. A variant according to any one of claims 1 to 10 or 12, a polynucleotide according 
to any one of claims 13 to 15 or a vector according to claim 16 for use in therapy. 

15 28. A computer readable medium having stored thereon a model of the MSP-1 19 NMR 
structure. 

29. A nucleic acid encoding a Plasmodium MSP-1 polypeptide, in which the nucleic 
acid is optimised for expression in a heterologous host cell. 

20 

30. A nucleic acid according to Claim 29, in which the heterologous host is a Pichia 
pastoris cell. 

31. A nucleic acid according to Claim 29 or 30, in which the polypeptide is selected 
25 from an MSP-1 42 polypeptide comprising a sequence shown in Figures 2C and 2E, an 

MSP-1 19 polypeptide comprising a sequence shown in Figure 2C, and an MSP-1 33 
polypeptide comprising a sequence shown in Figure 2E. 

32. A nucleic acid according to any of Claims 29 to 31, in which the nucleic acid 
30 comprises a sequence selected from the sequences of Figure 2A, Figure 2B and Figure 2D. 
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33. A nucleic acid vector comprising a nucleic acid according to any of Claims 29 to 

5 34. A host cell comprising a vector according to claim 33 . 

35. A pharmaceutical composition comprising a nucleic acid according to any of 
Claims 29 to 31 or a vector according to Claim 33 together with a pharmaceutical^ 
acceptable carrier or diluent, 

10 

36. A composition according to Claim 35 further comprising an immunogenic 
Plasmodium polypeptide or fragment or derivative thereof. 
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U Codon Usage Table for Komagataella (Pichia) pastoris 

# for use with CODOP program 

# Fields: [amino acid] [triplet] [frequency per thousand] [number] 

# codon preferences weights: 



# 


30 


preferred uniquely 


n 


20 


preferred alternative 


n 


10 


second best alternative 


n 


15 


two equally preferred codons 


n 


1 


unclassified 


u 


0 


forbidden codons 


n- 

# codon selection list 


# 







and not ATA 
and not GTA 
and not CCG 
and not GCG 



# Phe TTC 

# Leu TTG > CTG 
U He ATC = ATT 

# Met ATG 

# Val GTT > GTC 
U Ser TCT = TCC 

# Pro CCA > CCT 

# Thr ACT = ACC 
U Ala GCT > GCC 
#Tyr TAC and not TAT 
U * UAA 

# His CAC > CAT 

# Gin CAA > CAG 

# Asn AAC 

# Lys A AG 

U Asp GAC > GAT 

n Glu GAA = GAG 

U Cys TGT > TGC 

# Trp TGG 

#Arg AGA and not CGC , CGA , CGG 

#Gly GGT > GGA and not GGG 



PheUUU 1.0( 
Phe UUC 30.0( 
LeuUUA 1.0( 
Leu UUG 20.0( 

LeuCUU 1.0( 
Leu CUC l.0( 
LeuCUA L0( 
Leu CUG 10.0( 

lie AUU 15.0( 
He AUC 15.0( 
IleAUA 0.0( 
Met AUG 30.0( 

Val GUU 20.0( 
Val GUC 10.0( 
Val GUA 0.0( 
ValGUG 1.0{ 



1) 


Ser UCU 15.0( 


30) 


Ser UCC 15.0( 


1) 


Ser UCA 1.0( 


20) 


Ser UCG 1.0( 


0 


Pro CCU 10.0( 


0 


Pro CCC 1.0(- 


1) 


Pro CCA 20.0( 


10) 


Pro CCG 0.0( 


15) 


Thr ACU 15.0( 


15) 


Thr ACC 15.0( 


0) 


Thr ACA 1.0( 


30) 


Thr ACG 1.0( 


20) 


Ala GCU20.0( 


10) 


Ala GCC 10.0( 


0) 


Ala GCA 1.0( 


1) 


Ala GCG 0.0( 



15) Tyr UAU 0.0( 
15) Tyr UAC 30.0( 
1) Ter UAA30.0( 
1) Ter UAG 0.0( 

10) His CAU 10.0( 

1) HisCAC20.0( 

20) Gin CAA 20.0( 

0) Gin CAG 10.0( 

15) Asn AAU 1.0( 
15) Asn AAC30.0( 

1) Lys AAA 1.0( 
1) LysAAG30.0( 

20) Asp GAU 10.0( 
10) Asp GAC20.0( 
1) Glu GAA 15.0( 
0) Glu GAG 15.0( 

FIG. 14 



0) 


Cys UGU 20.0( 


20) 


30) Cys UGC 10.0( 


10) 


30) 


Ter UGA 0.0( 


0) 


0) 


Trp UGG 30.0( 


30) 


10) 


Arg CGU 1.0( 


1) 


20) 


Arg CGC 0.0( 


0) 


20) 


Arg CGA 0.0( 


0) 


10) 


Arg CGG 0.0( 


0) 


1) 


Ser AGU 1.0( 


1) 


30) 


Ser AGC 1.0( 


1) 


1) 


Arg AGA 30.0( 


30) 


30) Arg AGG 1.0( 


1) 


10) Gly GGU20.0( 


20) 


20) Gly GGC 1.0( 


1) 


15) 


Gly GGA 10.0( 


10) 


15) 


Gly GGG 0.0( 


0) 
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